Gene Arth_0134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0134 
Symbol 
ID4447397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp135687 
End bp137156 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content66% 
IMG OID639687929 
Productamino acid/peptide transporter 
Protein accessionYP_829635 
Protein GI116668702 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAA CTCATTTATC CGACCGCCCC GCTACAACGC CGGGCGATAC GTCATTTTTT 
GGCCACCCAA AGATGCTGGC CAGCCTCTTC TCCGTAGAAA TGTGGGAGCG TTTCTCCTTC
TACGGGATGC AGGGCATCCT CCTCTATTAC ATGTACTTCA CGGCCGCGCA GGGCGGCCTC
GAAATCGAGC AGGGCCTGGC CGCCGGCCTG GTGGGCGCCT ACGGCGGCGG AGTCTATCTC
TCCACCATTC TGGGCGCGTG GCTGGCCGAC CGGCTCTTCG GTTCGGAACG GGTCCTGTTC
GGCTCGGCCG TCCTGATCAT GGCGGGCCAC ATCGCCTTGG CGCTTGTCCC GGGCATTCCC
GGACTCATCG CCGGCCTGGT GCTGGTGGGC GTCGGCTCGG GCGGCCTCAA GGCAAATGCC
ACCGCGCTGG TGGGCACCCT GTACGGGGAG AAGGACGAAC GCCGCGACGC AGGCTTCTCC
ATCTTCTACA TGGGCATCAA CGCCGGCGCG CTGATCGGCC CGCTGGTTAC CGGCTGGCTG
CAGGAAAGCC GGGGGTTCCA CTGGGGCTTC GGCGCCGCCG CCGTCGGAAT GGCCCTTGGC
CTGGGCATCT ACGCGATGGG ACGCGGGAAG CTCCCCGAAG CGGCGCACCA CGTGCCCAAC
CCGCTCCCTG CGGCCCAACG CACCAGATAC GGGCTGATCT TCCTGGGCAT CGCCGCCGTG
GTGGCGGTTC TCCTGGCCAC CGGCGCCGTG AACGCCGAGA ACCTGGCCAT GTCCATGGCG
TACGCGGCCA TCGGCGCTTC CGTGCTCTAC TTCGGCCTGA TCTTCTCCAG CAGGAAGGTG
ACGGGCGTCG AGCGCAAGCG CGTGGTGGCC TTCATCCCGC TGTACATCGC CTCCGCGGCG
TTCTGGGCAC TGTTCCAGCA GCAGTTCACG TTCATTGCCG TGTACTCAGA GGAGAAGCTG
GACCGGAACC TCTTCGGCTG GGAGATGCCT GCCGCCTGGG TGCAGTCGAT CAACCCGGTG
TTCATCATCA TCTTTGCCGG CGTCATGGCG GCCCTGTGGA CCCGGATGGG CAACAAGCAA
CCCGGATCGG CCCTGAAGTT CTCCATCGGC CTGTTCGTGA TGGGCCTGGC CTTCCTGGCC
TTCATTCCGC TGGCCGGCAG CGGCAAGACG CCCCTCCTGG CACTGGTGGG CATCCTGTTC
CTGTTCACCC TGGCGGAGCT TTTCCTCTCC CCCATCGGGC TGTCCGTCAC CACCAAACTG
GCACCCCAGG CCTTCCATAC GCAGATGGTG GCCCTGTTCT TCCTGTCCGT TTCGCTCGGC
ACCACCCTGG CCGGCATACT GTCCGGGCTG TATAACCCGG ACGACGAACT GCCGTACTTT
ACCGGTATCG GCGGCACGGC CATGGTGCTG GCCGTCGGCC TCGCTGCCGC CTCGCCGGCC
ATCAAGAAGC TGATGGCCGG CGTACGCTGA
 
Protein sequence
MSTTHLSDRP ATTPGDTSFF GHPKMLASLF SVEMWERFSF YGMQGILLYY MYFTAAQGGL 
EIEQGLAAGL VGAYGGGVYL STILGAWLAD RLFGSERVLF GSAVLIMAGH IALALVPGIP
GLIAGLVLVG VGSGGLKANA TALVGTLYGE KDERRDAGFS IFYMGINAGA LIGPLVTGWL
QESRGFHWGF GAAAVGMALG LGIYAMGRGK LPEAAHHVPN PLPAAQRTRY GLIFLGIAAV
VAVLLATGAV NAENLAMSMA YAAIGASVLY FGLIFSSRKV TGVERKRVVA FIPLYIASAA
FWALFQQQFT FIAVYSEEKL DRNLFGWEMP AAWVQSINPV FIIIFAGVMA ALWTRMGNKQ
PGSALKFSIG LFVMGLAFLA FIPLAGSGKT PLLALVGILF LFTLAELFLS PIGLSVTTKL
APQAFHTQMV ALFFLSVSLG TTLAGILSGL YNPDDELPYF TGIGGTAMVL AVGLAAASPA
IKKLMAGVR