Gene Arth_0224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0224 
Symbol 
ID4447315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp233867 
End bp235402 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content67% 
IMG OID639688020 
ProductL-arabinose isomerase 
Protein accessionYP_829725 
Protein GI116668792 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2160] L-arabinose isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACC CCAATTACAC ATCCGCCAAC GGAACATCGC TCAGCCAGTA CGAGGTCTGG 
TTCCTCACCG GCAGCCAGCA CCTGTACGGC GAGGACGTCC TCAAACAGGT CGCAGCGCAG
TCGCAGGAGA TTGCCGACGC GTTGAACGGA TCCTCAGACG TTCCGGTCAA GGTGGTCTGG
AAGCCCGTCC TTACGGATTC GGACGCCATC CGCCGCACCG CGCTGGAAGC CAATGCCGAC
GATTCCGTGA TCGGCGTGAC GGCATGGATG CACACGTTCA GCCCGGCCAA GATGTGGATC
CAGGGCCTGG ACCTGCTGCG TAAACCGTTG TTGCACCTGC ACACCCAGGC CAACGTTGAG
CTGCCTTGGG CGGACATCGA CTTCGACTTC ATGAACCTCA ACCAGGCCGC CCACGGCGAC
CGCGAATTCG GCTACATCCA GTCCCGCCTG GGCATCCCCC GCAAGACCGT GGTGGGCCAC
GTGTCCAACC CGGAGGTCAC CCGGCAAGTG GGCGTCTGGC AGCGCGCGTC CGCCGGCTGG
GCCGCCGTCC GCACTCTGAA ACTGACCCGC TTCGGCGACA ACATGCGCAA CGTGGCCGTC
ACCGAAGGCG ACAAGACCGA GGCCGAGCTC CGCTTCGGCG TCTCTGTGAA CACCTGGTCC
GTGAATGAGC TCGCCGACGC CGTGCACGGC GCCGCGGAGT CCGACGTCGA CGCGCTCGTT
GCGGAGTACG AGCGCCTCTA CGAAGTGGTC CCCGAGTTGA AGGCCGGGGG AGCGCGGCAC
GAATCGCTGC GCTACAGCGC CCGGATCGAA CTGGGCCTGC GCAGTTTCCT CGAGGCCAAC
GGCTCGGCCG CGTTCACCAC CTCCTTCGAG GACCTGGGTG AACTGCGCCA GCTGCCCGGC
ATGGCCGTGC AGCGGCTGAT GGCGGACGGC TACGGCTTCG GCGCCGAGGG CGACTGGAAG
ACCGCCATCC TGGTCCGCGC CGCCAAAGTG ATGGGCTCTG GCCTGCCCGG CGGTGCATCA
CTAATGGAGG ACTACACCTA CCACCTCGCC CCCGGCCAGG AAAAGATCCT GGGCGCGCAC
ATGCTGGAGG TCTGCCCGTC GCTGACCGCC ACCAAGCCGC GCGTCGAGAT CCACCCGCTG
GGCATCGGCG GCAAGGAAGA CCCCGTCCGC ATGGTCTTTG ACACCGACGC CGGCCCTGGC
GTTGTAGTGG CGCTGTCCGA CATGCGCGAC CGCTTCCGCC TCGTGGCGAA CGCCGTTGAC
GTCGTGGACC TGGACGAGCC CCTGCCCAAC CTCCCGGTGG CGCGTGCGCT GTGGTCTCCG
AAGCCGGACT TCGCGACCTC CGCCGCGGCC TGGCTGACTG CCGGCGCGGC CCACCACACG
GTGCTCTCCA CCCAGGTGGG CATGGACGTG TTCGAGGACT TCGCCGAGAT CGCGAAGACC
GAGCTCCTCA CCATCGACGA GGGCACCACC ATCAGGCAGT TCAAGAAGGA ACTGAACTGG
AACGCCGCCT ACTACAGGCT GGCCGGCGGG CTCTAA
 
Protein sequence
MSNPNYTSAN GTSLSQYEVW FLTGSQHLYG EDVLKQVAAQ SQEIADALNG SSDVPVKVVW 
KPVLTDSDAI RRTALEANAD DSVIGVTAWM HTFSPAKMWI QGLDLLRKPL LHLHTQANVE
LPWADIDFDF MNLNQAAHGD REFGYIQSRL GIPRKTVVGH VSNPEVTRQV GVWQRASAGW
AAVRTLKLTR FGDNMRNVAV TEGDKTEAEL RFGVSVNTWS VNELADAVHG AAESDVDALV
AEYERLYEVV PELKAGGARH ESLRYSARIE LGLRSFLEAN GSAAFTTSFE DLGELRQLPG
MAVQRLMADG YGFGAEGDWK TAILVRAAKV MGSGLPGGAS LMEDYTYHLA PGQEKILGAH
MLEVCPSLTA TKPRVEIHPL GIGGKEDPVR MVFDTDAGPG VVVALSDMRD RFRLVANAVD
VVDLDEPLPN LPVARALWSP KPDFATSAAA WLTAGAAHHT VLSTQVGMDV FEDFAEIAKT
ELLTIDEGTT IRQFKKELNW NAAYYRLAGG L