Gene Arth_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2054 
Symbol 
ID4445428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2315201 
End bp2316565 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content67% 
IMG OID639689862 
Productmajor facilitator transporter 
Protein accessionYP_831534 
Protein GI116670601 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCACA CAACATCCGC TGCAGCGCCG CACCGGTCCT CCCCCGGGGA CGCCGCCCCC 
ACTTCCGCTG ATGGGCCTGC ATCGCGGTTC TCCAAAGCGT CGGCCGCGGC CGTCCTCGTT
TGCTGGCTTT TGGTGGTCTT CGATGGCTAC GACCTGATCG TCTACGGCAC TGTCCAGTCT
TCGCTGATCT CCGAGACCGG CTGGGGCCTG AACAAGGCCA CCGCCGGAAC CATCGGTTCC
ATGGCCTTCC TCGGCATGAT GATCGGCGCG ATCTTCGCCG GCCGGATGGC CGACTCGTGG
GGTCGCCGCC GCACCATCCT GGGTTGCGCC GTCGTGTTCT CGATCTTCAC CGCCCTCTGT
GCCTTCGCTC CCAGCGCTGC CGTCTTTGGC GGCCTGCGGC TCCTCGCCGG CATCGGGCTC
GGCGGGCTGG TGCCCTCGGC GAACGCCCTT GTTGCCGAAC TTGTCCCCAC CAAGTGGCGG
TCCACCATCG CCACCCTGAT GATGTCCGGT GTTCCGATCG GCGGATCCAT CGCCGCCCTC
GTCGGCATCC AGCTGATTCC CGCCTTCGGC TGGCAGGCCA TGTTCCTGGT GGCCGTGCTG
GCCCTGGTGA TCGTGGTGCC CCTGGGAGTG AAATACCTCC CGGAGACGCT TGCCCCGGTC
CGGGCCACAG GCAGTGCAAC AGAGAAGGCT GCCCGTACGG CAGCCGGCAA CGCCAGTGCG
GTCAAGGAAC CGTCCGGCTT CTCGTCCCTG CTCCGTGCCC CGTACCTTGG TGTCAGCATG
CTGTTCGCCC TGGCGACGAT CGCCACCCTG TTCGCTTGGT ACGGACTGGG AACCTGGTTG
CCCAACCTGA TGCAGTTGGC GGGCTACAAC CTGGGTTCCG CCCTGACCTT CGCGCTGGCC
CTCAACCTGG GTGCGGTGGC CGGTTCGGTC ATCACGGCCT GGGCCGGAAC CCGCTTCGGT
CCGGTGCCGA CTGCGATCGC CGCTGCCGCC GTCGCCGCCG TTGCGCTGGT GGTCCTCGTC
ACGGGGCCGT CTGTCACCGT CGTCTACCTC ATGCTGGTCC TCGCCGGCGT CGGCACCCAC
GGCACGCAGT GCCTGATCAT TGCCGCCGTC GCGAGCCACT ATCCCGGACA CCTGCGGGGC
ACCGCGCTGG GCTGGGCGCT CGGGACGGGC CGCATCGGTG CCGTCGTCGC GCCGCAGGTG
GGCGGACTCC TGCTGGCAGC CGGACTGGGC GTCAACTCCA ATTTCCTCGC CTTCGCCGGT
GCAGCCGCCA TCGCAGCGGT CCTGCTGGCC GCCGTCGGCC TCAAACTCAA GTCGAAACTC
TCAATTTCAC CCTCACACTC ATCAACAGGA GCAATCAATG TCTGA
 
Protein sequence
MTHTTSAAAP HRSSPGDAAP TSADGPASRF SKASAAAVLV CWLLVVFDGY DLIVYGTVQS 
SLISETGWGL NKATAGTIGS MAFLGMMIGA IFAGRMADSW GRRRTILGCA VVFSIFTALC
AFAPSAAVFG GLRLLAGIGL GGLVPSANAL VAELVPTKWR STIATLMMSG VPIGGSIAAL
VGIQLIPAFG WQAMFLVAVL ALVIVVPLGV KYLPETLAPV RATGSATEKA ARTAAGNASA
VKEPSGFSSL LRAPYLGVSM LFALATIATL FAWYGLGTWL PNLMQLAGYN LGSALTFALA
LNLGAVAGSV ITAWAGTRFG PVPTAIAAAA VAAVALVVLV TGPSVTVVYL MLVLAGVGTH
GTQCLIIAAV ASHYPGHLRG TALGWALGTG RIGAVVAPQV GGLLLAAGLG VNSNFLAFAG
AAAIAAVLLA AVGLKLKSKL SISPSHSSTG AINV