Gene Arth_3306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3306 
Symbol 
ID4444000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3712162 
End bp3713328 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content66% 
IMG OID639691130 
ProductL-rhamnose isomerase 
Protein accessionYP_832782 
Protein GI116671849 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4952] Predicted sugar isomerase 
TIGRFAM ID[TIGR02635] L-rhamnose isomerase, Streptomyces subtype 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.984678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGACG TAGCAACGGC GCTGGGCAGG CTCGAGGAGC TTGCCATCGA GGTCCCTTCG 
TGGGCCTATG GAAATTCGGG TACGCGCTTC AAGGTGTTCG GCACGCCGGG CACTCCCCGG
ACCGTGCAGG AGAAGATCGC GGACGCCGCC AAAGTCCACG AACTGACGGG CCTGGCGCCC
ACCGTTGCGC TGCATATTCC GTGGGACAAG GTGGATGACT ACGCCGCACT GCGCGAGTAT
GCGGCGGGCC TGGGCGTGGG CCTGGGCACC ATCAACTCGA ACACCTTCCA GGATGACGAG
TACAAGTTCG GTTCCCTGAC GTCCTCGAGC GAATCGGTCC GTCGCCGTGC GATCGACCAC
CACCTCGAAT GCATCGACAT CATGCACGCC ACCGGCTCGC GGGACCTGAA GATCTGGCTG
GCGGACGGCA CGAATTACCC GGGCCAGGAC GACATGCGCG GCCGGCAGGA CCGCCTGGCC
GAGTCCCTGC GGGAGATCTA CGCCGGCTTG GGCGATGCCC AGCGGCTGGT GCTGGAGTAC
AAGTTCTTCG AGCCGGCTTT TTACCACACC GACGTTCCGG ACTGGGGCAC GTCCTACGCC
CAGACCCTGG CGCTGGGGGA GAAGGCGTAC GTCTGCCTCG ACACCGGCCA CCACGCCCCG
GGCACCAACA TCGAGTTCAT CGTGATGCAG CTGCTGCGCC TGGGCAAGCT GGGCTCCTTC
GACTTCAACT CGCGCTTCTA CGCGGATGAT GACCTGATTG TGGGTGCGGC GGATCCGTTC
CAGCTGTTCC GGATCATGTA TGAGGTGATC CGCGGCGGCG GGTTCGGCAA GGACTCCGGT
GTGGCGCTCA TGCTGGACCA GTGCCACAAC CTGGAGGAGA AGATTCCGGG CCAGATCCGC
TCGGTGCTCA ACGTCCAGGA AATGACGGCG CGTGCCCTGC TGGTGGACAC CGCCGCCCTG
GGCGAGGCCC AGCGCGCCGG TGACGTGCTG GCAGCCAACG CCGTCTTCAA CGACGCCTTC
TACACGGATG TCCGCCCGGC CCTGGCGGCA TGGCGTGAAT CCCGCGGCCT GCCCGCGGAC
CCGATGGCCG CCTTCAAGGC CAGCGGCTAC CAGAAACAGA TCAACGAGGA CCGCGTGGGC
GGTCACCAGG CCGGATGGGG CGCATAA
 
Protein sequence
MNDVATALGR LEELAIEVPS WAYGNSGTRF KVFGTPGTPR TVQEKIADAA KVHELTGLAP 
TVALHIPWDK VDDYAALREY AAGLGVGLGT INSNTFQDDE YKFGSLTSSS ESVRRRAIDH
HLECIDIMHA TGSRDLKIWL ADGTNYPGQD DMRGRQDRLA ESLREIYAGL GDAQRLVLEY
KFFEPAFYHT DVPDWGTSYA QTLALGEKAY VCLDTGHHAP GTNIEFIVMQ LLRLGKLGSF
DFNSRFYADD DLIVGAADPF QLFRIMYEVI RGGGFGKDSG VALMLDQCHN LEEKIPGQIR
SVLNVQEMTA RALLVDTAAL GEAQRAGDVL AANAVFNDAF YTDVRPALAA WRESRGLPAD
PMAAFKASGY QKQINEDRVG GHQAGWGA