Gene Arth_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1920 
Symbol 
ID4445539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2161496 
End bp2163076 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content66% 
IMG OID639689730 
ProductABC transporter related 
Protein accessionYP_831402 
Protein GI116670469 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.528555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCCC ATAACCCGGT CTCGCCTTCC ACCGCAGCGG CGGAGACGGC ACGAGACACC 
CACGACGGCG CTTCGGAGCT GCGGCTCGAT GGCGTCACCA AGTCCTATCC TGGCGTCCAG
GCGCTGAAGG GAGTCAGCTT CAGCGTCGCC CGTGGATCCA TTCATGCACT TGCCGGGCAG
AACGGCGCAG GCAAGTCGAC ACTCGTCAAG ATTCTCTCCG GAGCGGAGTC CCCGGACAGC
GGAAGCATAC GCCTTGGCGG CGAGTTTCAG CGCTTCCGTG ATCCGATGGA CGCCCAGCGC
GCCGGGATCC ACACCATTTA CCAGGAACTA TCGCTCGTGC CGTCCCTTTC GGTGGCGGAG
AATATCTTCC TTGGCCAGCT GCCTCGACGG GCGGGGGCCT CCGTCGACTG GCAGCGCATG
CAGGCCGAAG CCCGGACCGC GCTGGACCGC GTAGGTTTCC ATCTCGATGT CCGCCGCCCC
GTGGGAAGTT ACTCCACAGC GGAACAACAG GCCGTGGAGT TGGCCAAGGC CCTGCACAAG
GACGCCCGGG TCCTGCTCCT GGATGAACCC ACCTCCACCT TGCCCTTGCC CGATGTCGAG
CGGCTGTTCA CCGTCCTCCG CTCCCTTTCG GAACAAGGCG TGACGCTGCT ATACATTTCG
CACCGCATGG ATGAGCTGTT TTCGCTCTGT GACGCCGTCA CCGTGCTGCG CGATGGCGTG
AACGCCGCCG ACCTGAAAAC GGCAAGTTCC AAGCCCGCCG ACGTCGTCAC GGCAATGGTC
GGAAAGAGCC TTGAAGGCTC CATTGCCGAT GCGGCCCTCC GCGGGGAACG CTCGCCCAGC
CTGGGGGCCG GTCCCCGGGA GAATGTCATT CTCTCTGCCC GCGACCTCTC CGAGCAGGGG
CATGTGGACA GGGTGTCGTT CGAGCTGCGT GAAGGCGAAG TGCTCGGCAT CTCCGGGCTC
ATCGGCAGCG GCCAGTCAGA ACTGGCCGGC CTCATCGCCG GTGCGAGAAG GCGGACCTCC
GGCGAGATCC GCGTCGATGG GAAAGCGGTG GACTTCCGTG CCCCCCGCGA CGCGATCCGC
CGGGGGATCG GACTGCTGCC GCAGGACCGC AAGGCAGCGG GCTTCATCCC GGACATGGGT
GTTGCAGGCA ACATCACGCT CGCCAGCCTG CCGCAATTCA GCAGGCTGAG CGTGATCCAG
TCCCGGCGGG AGCGCAGTGT GGCCGGGGAG ATGGTGGCCC GGCTCGGCAT GAAGGTATCC
GGGGTGCACC AACCGCTCAA AACGCTGAGC GGGGGAACCC AGCAGAAGGC CATCCTGGCC
CGCTGGCTTG TCCGCCAGTC GCGCATCCTC GTGTGCGACG AACCAACCCG CGGCGTGGAC
GTCGGGGCGA AGGAGGACAT GTACGAGCTC ATCCGCGAGT TCGCCCAGGC GGGAGGAACC
GTCGTGGTGG CGAGCTCGGA GATTACGGAG GCGATGATGT GCGATCGCGT CCTCGTGATG
GCGCGCGGCA AAGTCGTCGC CGAACTCGAT CACGACGACA TCGACCCCTC CGGCAACGCC
ATTCTCGAGC GCTTCGCCTG A
 
Protein sequence
MLAHNPVSPS TAAAETARDT HDGASELRLD GVTKSYPGVQ ALKGVSFSVA RGSIHALAGQ 
NGAGKSTLVK ILSGAESPDS GSIRLGGEFQ RFRDPMDAQR AGIHTIYQEL SLVPSLSVAE
NIFLGQLPRR AGASVDWQRM QAEARTALDR VGFHLDVRRP VGSYSTAEQQ AVELAKALHK
DARVLLLDEP TSTLPLPDVE RLFTVLRSLS EQGVTLLYIS HRMDELFSLC DAVTVLRDGV
NAADLKTASS KPADVVTAMV GKSLEGSIAD AALRGERSPS LGAGPRENVI LSARDLSEQG
HVDRVSFELR EGEVLGISGL IGSGQSELAG LIAGARRRTS GEIRVDGKAV DFRAPRDAIR
RGIGLLPQDR KAAGFIPDMG VAGNITLASL PQFSRLSVIQ SRRERSVAGE MVARLGMKVS
GVHQPLKTLS GGTQQKAILA RWLVRQSRIL VCDEPTRGVD VGAKEDMYEL IREFAQAGGT
VVVASSEITE AMMCDRVLVM ARGKVVAELD HDDIDPSGNA ILERFA