Gene Anae109_1759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1759 
Symbol 
ID5375360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1980196 
End bp1981791 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content72% 
IMG OID640843267 
ProductNa+ symporter 
Protein accessionYP_001378946 
Protein GI153004621 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCA TGACTTCGTG GGTCTACGTG GGGATCGTCG TCGCGTACGT GGCCGTCATG 
ATCGGCGTGG GCTGGCTCGC CATGCGCCGG ACCCGGGACG TCCACGACTT CTTCATCGGG
GGACGCAGCC TCGGGCCGTG GATGAGCGCG TTCGCCTACG GCACGACGTA CTTCTCGGCC
GTCCTCTTCA TCGGCTACGC GGGGAAGCTG GGCTGGGCGT TCGGGATCCA CACGCTCTGG
ATCGTCCTCG GCAACACGGT GGTCGGGACC ATCCTCGCCT GGAAGGTGCT CGCCGGGCGC
ACCCGCGAGA TGACGGCGCG CCTCGACGCC ATCACCATGC CGCAGTTCCT CGCCGCCCGG
TACGGCTGTC GCGGACTCCA GATCGCGGCC GCGCTGGTGG TGTTCGTGTT CCTCGTCCCG
TACTCCGCGT CCGTGCTCAT GGGGCTCTCG TACCTGTTCG AGATGACCCT CCACATCCGC
TACGAGACCG CGCTCTACCT CCTCACCGCC ATCACCGCCG TGTACCTGGT GATGGGCGGC
TACTTCGCGG TCGCGGTGAG CGACTTCGTG CGCGGCATCG TGGAGTTCGC GGGCGTGATG
GCGATGGTGT GGCTGCTCGC GCACCGGCCG GAGGCGGGCG GCTTCTTGGA GGCGACGCGG
CGGCTGCTGG GCGACCCCGC CACCATGGCG CCGGGGCTCG TGGCGGTGAA GCAGGTGGGG
GCGGGCACGC CCCTCGGCGT CGCGGTGCCG GGGTGGCTCA CGCTCGCGGC GCTGGTCCTC
ATCACCAGCC TCGGGCCGTG GGCCCTCCCG CAGATGGTCC AGAAGTTCTA CTCGATCCGC
TCCCGCGCCG ACGTGACGCG CGCGCTCGTC ATCGCCGGAG TCTTCGCGCT CTTCATGGCG
TTCGGCGCGT ACTACAGCGG CGCCCTCACC CACCTCCGCT ACGGCGCGCG CCTCCCGCCC
GAGCTCGTCG GTCCGTCCGG CCCGATCTGG GACAAGATCA TGCCGCACTT CATCACGACG
TCCGGGCTGC CCGAGGCGCT CGTGCTCGTG ATCGTGCTGA TGGTCTTCTC CGCGTCGATG
TCGAGCCTCT CGTCCCTCGT GCTCGTCTCG TCCTCCGCCC TCGGCATCGA CCTGTACGGC
GCGCTCGCGG GCCAGGGCCG GACGCCGCGG CACCAGATGG CGGTCCTCCG CGCGCTCTGC
GCGGTGTTCG TCGGCCTCTC GCTGGTGATC GCGCTCGCGA AGCCCGCGGT GATCGTGAAC
CTCATGGTGA TGAGCTGGGG CACGCTCTCG GCGGTGTTCC TCGCGCCGTT CGTGTACGGG
CTCTTCTGGC GGCGCGCGAA CCGCGCCGGC GCGTGGGCGG CCATCGTCGC GGGGCTCGCC
ACGGCCCTCG TGCTCTTCCC GGCGTGGGGC GGAGACGGCG TCCCGCTCGC CGGCGCGATC
GCGACGCTGC TGCCGCTCGC CGTCCTCCCC GCGGTGAGCC TCCTCACCGG CGCGCCCGAG
GCGTCGCGCG TCGCCCGTGC GTTCGGCGAC GCCGACCCCG CGGCGGCCGC GGCCAGCGAT
CGCGACGCGG AGGGCGGACG GCGGGTGGCG GGGTGA
 
Protein sequence
MSAMTSWVYV GIVVAYVAVM IGVGWLAMRR TRDVHDFFIG GRSLGPWMSA FAYGTTYFSA 
VLFIGYAGKL GWAFGIHTLW IVLGNTVVGT ILAWKVLAGR TREMTARLDA ITMPQFLAAR
YGCRGLQIAA ALVVFVFLVP YSASVLMGLS YLFEMTLHIR YETALYLLTA ITAVYLVMGG
YFAVAVSDFV RGIVEFAGVM AMVWLLAHRP EAGGFLEATR RLLGDPATMA PGLVAVKQVG
AGTPLGVAVP GWLTLAALVL ITSLGPWALP QMVQKFYSIR SRADVTRALV IAGVFALFMA
FGAYYSGALT HLRYGARLPP ELVGPSGPIW DKIMPHFITT SGLPEALVLV IVLMVFSASM
SSLSSLVLVS SSALGIDLYG ALAGQGRTPR HQMAVLRALC AVFVGLSLVI ALAKPAVIVN
LMVMSWGTLS AVFLAPFVYG LFWRRANRAG AWAAIVAGLA TALVLFPAWG GDGVPLAGAI
ATLLPLAVLP AVSLLTGAPE ASRVARAFGD ADPAAAAASD RDAEGGRRVA G