Gene Arth_1720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1720 
Symbol 
ID4445759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1921141 
End bp1922565 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content65% 
IMG OID639689542 
Productmajor facilitator transporter 
Protein accessionYP_831214 
Protein GI116670281 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTGAAA CTCCCCATAT AACGCCGGGC TCTGCGACCT CCCGCGATCC CCGGACCTGG 
CCCCGGGCCA AGCGCAACCG CTACGGCTGG TTCATGACGT TCAGCCTGCT TTTCCTGATG
ATGCTCAGCT GGGCGGACAA AGCAGTCCTG GGCATTGCCG CCGTCCCCCT GATGAAACAG
CTGGGCATCA CGCCCGAACA GTTTGGCCTG GTGGGCAGTG CCATGTTCCT GACCTTCGGC
GTTGCCCAGA TCGTTGCTGC CCCCATCGCC AACAAGGTAT CGAGCAAGTG GATCCTGCTG
GTCCTGTGCC TGCTGTGGTC AGTGGCGCAG GTTCCAATCC TGCTGTTCGC TTCCCTCCCC
GCGCTTTGGG CAAGCCGCTT GCTGCTCGGG GCCGGCGAGG GCCCGCTGGC GCCCGTCCTG
ATGCACGGCA TCTACAAGTG GTTCCCCGCA AAGAAGGGCG CCACCCCTGC CGCGCTGGCA
TCCTCCGGCG TGACCTTGGG CATTGTGGCG TTCGCTCCCG TTTTGGCCTG GGTCATCGGT
CAATTCGGCT GGCAGACCGC CTTCGCCGTG CTGGCCATTG TCGGACTGGT TTGGTCCATC
TTCTGGTTCA TCGTGGGCAA GGAAGGCCCG TACACGAGCC GGAAGGCTGA ACAGGAACTT
GACGGCATTG CCCCGGAAGA GGCACCTGTC GTGGCCGAAG CCAAGGTCCG CTACTGGCGC
ACCATCCTGT CCCCCAGCTG GATCTTCTCA GTCCTGGCCT CCTTCTTCGG CTACTGGACC
TTCACCCTCG CCATGTCATG GGGCCCGGCC TATTTCCAAA ACGTCCTCGG CTTCAGCGGA
CAACAGGCCG GCACCATGAT CGCCCTGCCC GCCGCCTGGG GAACCATTGC CACTGTCGGC
CTCAGTGCAC TCACCCAGCG CCTCCACCTC AAGGGCGTTC CCACCAGAAA GGCACGCGGC
TGGGTACTCG GCAGTGCCGG CGCCTTCGCC GGAGCGTGCC TGGTAGGGGC GACCATGACC
ACGTCCCCCG TCCTGTCCAT CGCCTTGATG GTCTTCGGCT TTGGCACCGC ACCCGCGCTC
TTTGCCATCA CCTACCTGGT GGTGGCCGAG CTGACCACCA TCGGCCAGCG CGGCGCTAAC
CTCTCCATCG CCAACGCCGT CCTCACCACC GGAGGTGTGT TCGCACCCGC AGTGTCCGGA
TTCCTGATCG GCGGTGCGGC CACCCCGGCA GATGGTTACC GTGCCGCGTT CGCCCTGGCC
GGAGGGCTGA TGCTGACGTT CGGCGCCCTG GCCCTGGTGT TCGTCAACCA GCAGCGTGAC
CGCCGCAGGC TTGGCCTTGA CGTAACCGCA GGCTTCCCGC TCGAGGCATC CACCCCCGCT
GCCGGATCCG AGACGGCTGC AATAGCGGCA GTCACTAAGG CCTAA
 
Protein sequence
MSETPHITPG SATSRDPRTW PRAKRNRYGW FMTFSLLFLM MLSWADKAVL GIAAVPLMKQ 
LGITPEQFGL VGSAMFLTFG VAQIVAAPIA NKVSSKWILL VLCLLWSVAQ VPILLFASLP
ALWASRLLLG AGEGPLAPVL MHGIYKWFPA KKGATPAALA SSGVTLGIVA FAPVLAWVIG
QFGWQTAFAV LAIVGLVWSI FWFIVGKEGP YTSRKAEQEL DGIAPEEAPV VAEAKVRYWR
TILSPSWIFS VLASFFGYWT FTLAMSWGPA YFQNVLGFSG QQAGTMIALP AAWGTIATVG
LSALTQRLHL KGVPTRKARG WVLGSAGAFA GACLVGATMT TSPVLSIALM VFGFGTAPAL
FAITYLVVAE LTTIGQRGAN LSIANAVLTT GGVFAPAVSG FLIGGAATPA DGYRAAFALA
GGLMLTFGAL ALVFVNQQRD RRRLGLDVTA GFPLEASTPA AGSETAAIAA VTKA