Gene Arth_1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1895 
Symbol 
ID4445584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2133045 
End bp2134400 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content62% 
IMG OID639689707 
Productmajor facilitator transporter 
Protein accessionYP_831379 
Protein GI116670446 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00895] benzoate transport 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.004098 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAT TCACGCCAAT CAAAGCCATG GCGGACGCCC GCCTTAATCG TTTTCATATC 
ATTTTGCTGC TCTGCTGCTC ATTCATCATG TTCTTCGACG GATACGACCT GATTGTCTAC
GGCTCGGTCC TGCCAACCCT AATGACGGAA TGGTCCCTCA CCCCTGATCA GGCAGGCTGG
CTCGGTAGCG CAGCCCTCAT CGGCATGATG ATCGGTGCCC TGACCCTCGG TTCCCTGGCC
GACAGGATCG GACGGCGGCC AGTGGTCGTT CACGGCACAC TGCTCTTCTC ACTGGCGGCA
ATAGCAACCG GGCTCGCTAC GACACCAGAA GCCTTCGGCG CGCTTCGGTT CCTGACCGGC
ATCTTCCTGG GCGGCGTCAT ACCCAACATC GTGTCGCTGA TGAACGAACT CGCGCCGAGG
GCCAACCGCC ACGCCCTGAC GACGATCATG CTCAGTATCT ACTCCGTCGG TTCAATCGTC
GCTACGCTCG TCGCCCTCTG GGTTCTGCCT CTTCTGGGCT GGAAACCGGT CTTCTTCCTT
AGCGGTGCAG CGCTGCTGTT CCTCCCCTTC CTGTACCGGT GGATGCCTGA GTCCATGACT
TTTCTGATGA GCCGCGGGAA GGAAACAGAG GCCCGCGCCC TGCTTCGCCG CGCAGTGCCG
ACCCAGAACC ACGAGCACGT GCACTGGACA GTTCCTGCCC CTCAGCACCG TCCGTCGGTT
TCCGCGTCTC GGCTCTTTCG AGAGGGCCGG CTCCTGGGCA CCCCGATGGT GTGGCTGAGC
TTCGGAATGT GCATGCTCAT GGTCTACGGC CTGAACACCT GGCTACCGAA AATCATGATC
GCCGGCGGCT ACGACCTGGG ATCCAGCCTG CAGTTCCTTA TCGTCCTGAA CATCGGAGCC
ACCGTCGGTG CCTTGGCCGG AGGCTGGCTG GGTGATCGCT TCGGAAACAA ACTGGTCCTG
GTTATTTTCT TCGCCCTGGC CGTCGTGTCG CTGATCCTGC TGGGAACCCA CCCCGGCCCG
GAGCTCCTCA ACATCCTCCT CTTCATCGCA GGCGCCACCA CGATCGGGAC TCTCGCCGTC
GTCCACGCCT TTGGCGCCGA CTACTACCCG GCCGAAATAC GCTCCACCGG TGTCAGGTGG
TGCTCAGCGA TGGGACGGTT CGGCGCCATC GCAGGGCCGA TCCTGGGCGG AGCACTGATC
GGGCTCAAAC TGCCCCTGGG CCAGAACTTT TTGATCTTCG CAATCCCTGG CGTCATTGCC
ATCGCCGCGG TGCTGCTGGT TGCGCGTACC AAGACCGTCG AGGAATCGCA CGCCGAACCG
CAGCCTGCCG AATCCCAAAC GTCCAGCATC AGCTAG
 
Protein sequence
MTQFTPIKAM ADARLNRFHI ILLLCCSFIM FFDGYDLIVY GSVLPTLMTE WSLTPDQAGW 
LGSAALIGMM IGALTLGSLA DRIGRRPVVV HGTLLFSLAA IATGLATTPE AFGALRFLTG
IFLGGVIPNI VSLMNELAPR ANRHALTTIM LSIYSVGSIV ATLVALWVLP LLGWKPVFFL
SGAALLFLPF LYRWMPESMT FLMSRGKETE ARALLRRAVP TQNHEHVHWT VPAPQHRPSV
SASRLFREGR LLGTPMVWLS FGMCMLMVYG LNTWLPKIMI AGGYDLGSSL QFLIVLNIGA
TVGALAGGWL GDRFGNKLVL VIFFALAVVS LILLGTHPGP ELLNILLFIA GATTIGTLAV
VHAFGADYYP AEIRSTGVRW CSAMGRFGAI AGPILGGALI GLKLPLGQNF LIFAIPGVIA
IAAVLLVART KTVEESHAEP QPAESQTSSI S