Gene Arth_4011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4011 
Symbol 
ID4447812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4527105 
End bp4528325 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID639691842 
Producttranscriptional regulator 
Protein accessionYP_833486 
Protein GI116672553 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCAGC AGGATGTGGA ACAGCTCGTG GAGCAGGTAG CCGTAAAGCT CGGCCGTGGA 
CTGTCGCTCG AAGATCTCGA CGGCGTTCTG CTCGCCTACA GCTCCAATCA GTCCCACGCG
GACCGGGTCC GGGTGAACTT CCTGCTCAGC AAACGCGTAC CGGCGGACGT GAAGGCGTGG
CAGCTATCGC ACGGTATCGC GACGGCGGTG CGTCCCGTCG TCGTACCCGC CAACGAGGAT
CTTGGCATGC TGGGACGCGT CTGCGTCCCG CTGCTGGTCC GCGGTTTCCG CGTCGGTTAC
CTGTGGGTGC AGCAGGACAT TGACGACCAA AGCGCGACGG CGATTCTCAC CCAGCTGCCT
GGCGTCCGGG ACGAACTCGA ACTTCTGTCC GGGCTGCTCC TCGAGTCGAA CACGGCCGAA
TCCGAGTTCC GGCGCCGCAG GGAGCAGGAG TTCCTCAGCG CCTGCCGCGG TGAAGCGAAT
GCCGTTGCCG CCGTGGCCGG CTGGAAGGAG GTGCAGGGCC GCGGCCCGTG GCAGCTTGTC
ACAGTGCTCG ACGCCGACGG CTGGGCGGAG GGATCCGACC CCATCGCCTC AACCCTGATC
CACCGGTCCT CGGCCCTGCA GGCGACCATC GGTGTGGACG CGGCGCTCTT CAGTGCCGGC
ACGGAAACCC ACGCGGTGGT CCTGTTCCGG GAATCTACCG GGCGGGCGGC CCATGCGCAG
GTCCTGGTCC ACTACCAGCT GGAACTTGCC AAGCGGTCCG GGCGGCCCGT GCACCGGATC
ATCCTTGGAA CAAGCGAAGG CTTCGCCAAG CCGCGTCAAC TGGCAGACGC CTACCGGCAG
TCCAAGCAGG CCGCGCAGGC CGCCGCAGTG GATTCCCAGC TGGGCGAGCT GGTGGATTGC
CGGGCCACCG GCGTCTACCA GCTGCTGGCC TCCGCCGGTG GCGGCGCCGG GGCCTGGGCC
GACGCCGGTT CCGTCTACTG GCGCATCCTG GAAGACCACG ATCGGAACGG TGAGCTCCTG
CCCGTGCTGG AACTCCTGTA TGACAATGAC GGTTCAGTGC AGGACGTCGC CACCAGGCTG
CATCTGCACC GGAGCAGTAT TTACAACCGG CTGGGCCGCA TCCGGCAGGT CCTTGGCGTG
GATCCGCTGA AGGGCATGGT CCGGCTCGAA CTCCATGCGG CCCTCAAGGC CCGCCGCTGG
GCAGGACGCC CACGGATTTA G
 
Protein sequence
MHQQDVEQLV EQVAVKLGRG LSLEDLDGVL LAYSSNQSHA DRVRVNFLLS KRVPADVKAW 
QLSHGIATAV RPVVVPANED LGMLGRVCVP LLVRGFRVGY LWVQQDIDDQ SATAILTQLP
GVRDELELLS GLLLESNTAE SEFRRRREQE FLSACRGEAN AVAAVAGWKE VQGRGPWQLV
TVLDADGWAE GSDPIASTLI HRSSALQATI GVDAALFSAG TETHAVVLFR ESTGRAAHAQ
VLVHYQLELA KRSGRPVHRI ILGTSEGFAK PRQLADAYRQ SKQAAQAAAV DSQLGELVDC
RATGVYQLLA SAGGGAGAWA DAGSVYWRIL EDHDRNGELL PVLELLYDND GSVQDVATRL
HLHRSSIYNR LGRIRQVLGV DPLKGMVRLE LHAALKARRW AGRPRI