Gene Arth_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1810 
Symbol 
ID4445674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2025958 
End bp2027124 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content63% 
IMG OID639689628 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_831300 
Protein GI116670367 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0233725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCAAC GGAGAAACCG CTACCGGGGG AGCGCGGCAG TAGCCGTGCT TGCCTGCCTT 
GTCGCACTGA CCACGGCCTG CGGTCCCGCC CCGGCCAGCC AGACGTCACA GGCAGACTCC
GAGGCGTCAG CGGCGTTCAC GGCAAAAATC ACCGCCGACG TCGCAGCTGC AACCGCACCG
CAGACCACCA CGACCAACCC CGTCCCAGCC AGCGCATCGC TCTCCAACGG GCCCAAGAAG
ATCGTGATCA TTCCTTGTTC CATGGCCGTG GAAGGGTGCG CGCGGCCGGC CCGTTCAACG
CAGGAAGCCT CGCGGCTGCT TGGCTGGGAC GCGAGCATCG ACGACCCCGC GAGCGACAGC
ACCAAGATCT CCGCGGCCAT CCAGCGGGCA GTCTCCCGGA AAGTCGATGC CATCGCGTTG
ACGTCCATCG ACGCCGCTGC AGTTCAGGGT GACATCAAGT CGGCCAGGGA TGCAGGCATC
GCCGTCACCT GCAACATGTG CGGCAACAAG GATGACCTTT ACCAGTCGCT GATTCCAGCT
CTGGACGCAA ACAACAAAGC AGGATACCTC TTGGGGGAGT TCGCGTTCCT TGAAGCGCGC
AAGCGTTTCA ATTCAGCACC GAAATTCATC GTCCTGACCG ATCCGGAATT CGACACCGTG
AAGGCGCGCG TGAGCGGCCT CAAACAGTTC ATTGAGGACT GCAAGGCTGC CGGAGCAGGT
TGCGAGCAGG TAGCCGAGAG CTCGTTCCTG GCAGGTGAAA TCAGCACCGT CGCGCCAGGG
CGCGTTGCCC AGTTGGCCCG CAGCAACCCG CGCTACAACG TGCTGTTCGC AGGATTCGAC
GCGGCCATGC TGTTCTTCTC CCAGGGGCTA CAGCAGGCAG GGCTGGCTGA TTCCAAGAAA
GCTTTCGGAA TTTCGGTGGA CGCCGACGTG GCCAATACCG AGATGATCCG CAAGGGGGGT
TTCCAGGCGG CGTCCATCGG ATTCGCCTTC GGCCGTGCAG GTTACGGCCA GGTGGACAAT
CTCAACAGGA TCTTCAGTGG CCAGAAGCCG CAGGACCAGG GCATCACCGG CAAGCTCGTT
ACAGCGGAGA ACGCCCCGGC TTCCGGCGGC TGGGACGGAG ACTTCGACGG AGTCGCCCTT
TACAAGGGAC TGTGGAAGGT CGGCTGA
 
Protein sequence
MFQRRNRYRG SAAVAVLACL VALTTACGPA PASQTSQADS EASAAFTAKI TADVAAATAP 
QTTTTNPVPA SASLSNGPKK IVIIPCSMAV EGCARPARST QEASRLLGWD ASIDDPASDS
TKISAAIQRA VSRKVDAIAL TSIDAAAVQG DIKSARDAGI AVTCNMCGNK DDLYQSLIPA
LDANNKAGYL LGEFAFLEAR KRFNSAPKFI VLTDPEFDTV KARVSGLKQF IEDCKAAGAG
CEQVAESSFL AGEISTVAPG RVAQLARSNP RYNVLFAGFD AAMLFFSQGL QQAGLADSKK
AFGISVDADV ANTEMIRKGG FQAASIGFAF GRAGYGQVDN LNRIFSGQKP QDQGITGKLV
TAENAPASGG WDGDFDGVAL YKGLWKVG