Gene Anae109_3836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3836 
Symbol 
ID5375462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4470202 
End bp4471821 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content68% 
IMG OID640845361 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_001380999 
Protein GI153006674 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.808948 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCGA TCGACGCGAT CCAGGCCGTC CTCCTCACGC AGGCCGCCAC CGCCCCCGCG 
CCGGCAGCCG AGCAGATGCA GTCCTCGCTC GGCTCCCCGT CGCTGATGTC CATCCTCTTC
TTCTTCATCA TCGTCGCGGT GACGCTCGTC ATCACGTACT GGGCGGCCCG GAAGACGAAG
ACCTCCTCCG AGTTCTACGC GGCCGGCCGC AGCGTCTCGG CGCTCCAGAA CGGGTTCGCG
CTCGCAGGCG ACTACATGTC CGCGGCGTCG TTCCTCGGCA TCTCCGGCAT GGTCGCCCTC
AAGGGCTACG ACGGCATGAT CTACGCGACC GGCTGGCTCG TCGGCTGGCC CGCGCTCATG
TTCCTCGTCG CCGAGCCGCT GCGGAACCTC GGCAAGTTCA CCTTCGCCGA CGTGGTCGCC
TTCCGGCTGC GGCAGAAGCC GGTCCGCATC GCCGCCGCGA TCGGCGGCAT CCTGACGGTG
CTCTTCTACA CGATCGCGCA GATGGTCGGG TCGGGCGCCC TCATCCAGCT CATGTTCGGC
CTCAAGTACG AGTACGCCGA GCTGATCGTC GGCGTGGTGA TGCTCGCCTA CGTGCTCTTC
GGCGGCATGC TCGCCACCAC CTGGGTGCAG ATCATCAAGG CCGGCCTCCT CCTGTTCGGC
GCGAGCCTCC TCACGGTCCT CGTGCTGGCG AAGTTCGGCT TCAACCCGGG CAACCTCTAC
TCCGCCGTGG TCGCGAAGTA CGGCCAGGTG GCGCTCGAGC CGGGCGGCAT CGTCGCGAGC
CCGCTCGAGG CGGTCTCGCT CGGCCTCGCC CTCATGTTCG GCCTCCTCGG GCTGCCGCAC
ATCCTGATGC GCTTCTACAC GGTGCCCGAC GCCAAGGCGG CCCGGAAGTC GGTGCTCTAC
GCCACCGGCC TCATCGGCTA CTTCTACGTC ATCATCCCCA TCGTGGGCTT CGGCGCGGCG
GTGCTCCTGC CCGGCGGCCG CAGCACCATC ACCGGGTTCG ACGCCGGCGG CAACATGACC
GCCCCGCTCC TCGCCGAGCT CCTCGGCGGC ACCGCGTTCC TCGGCTTCAT CGCGGCGGTC
GCGTTCGCCA CCATCCTCGC CGTGGTCGCC GGCCTCACCC TCGCCGGCGC GTCGGCCTAC
TCGCACGACA TCTACGTGAA CGTCATCAAG GGCGGGAAGG CGACGGAGGA GGAGCAGGTC
AAGGCCGCCA AGACCGCGAC CATCATCTTC GGCGTGGTGG CGGTCGGCCT GGGCATCCTC
TTCAAGGGCC AGAACGTCGC GTTCATGGTC GGCCTCGCGT TCGCGATCGC CGCCAGCGCC
AACTTCCCGG CGCTCCTCAT GTCGATCGTG TGGAAGCGCT TCACGACGCA GGGCGCGGTC
TGGTCGATCC TCGTCGGCGC GTTCAGCTCC TGCGCGATGA TCGTGCTCTC CAAGACGGTC
TGGGTGGACG TCTTCGGCTT CGCGCACGCC ATCTTCCCGA TGAAGAACCC GGCCATCTTC
TCGATGACGG GCGCCTTCGC GGTCGGCATC CTGGTCTCGC TGCTCACCCC CGAGCGCGAG
GCGCAGGAGA AGTTCGAGGA CGAGAAGCTC CGCACGTACC TCGGCGTCGG CGCCGAGTAG
 
Protein sequence
MTSIDAIQAV LLTQAATAPA PAAEQMQSSL GSPSLMSILF FFIIVAVTLV ITYWAARKTK 
TSSEFYAAGR SVSALQNGFA LAGDYMSAAS FLGISGMVAL KGYDGMIYAT GWLVGWPALM
FLVAEPLRNL GKFTFADVVA FRLRQKPVRI AAAIGGILTV LFYTIAQMVG SGALIQLMFG
LKYEYAELIV GVVMLAYVLF GGMLATTWVQ IIKAGLLLFG ASLLTVLVLA KFGFNPGNLY
SAVVAKYGQV ALEPGGIVAS PLEAVSLGLA LMFGLLGLPH ILMRFYTVPD AKAARKSVLY
ATGLIGYFYV IIPIVGFGAA VLLPGGRSTI TGFDAGGNMT APLLAELLGG TAFLGFIAAV
AFATILAVVA GLTLAGASAY SHDIYVNVIK GGKATEEEQV KAAKTATIIF GVVAVGLGIL
FKGQNVAFMV GLAFAIAASA NFPALLMSIV WKRFTTQGAV WSILVGAFSS CAMIVLSKTV
WVDVFGFAHA IFPMKNPAIF SMTGAFAVGI LVSLLTPERE AQEKFEDEKL RTYLGVGAE