Gene Cphamn1_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0040 
Symbol 
ID6373683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp42815 
End bp44347 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content54% 
IMG OID642682562 
Productsodium:neurotransmitter symporter 
Protein accessionYP_001958510 
Protein GI189499040 
COG category[R] General function prediction only 
COG ID[COG0733] Na+-dependent transporters of the SNF family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCAG AAAGTGCCGG CAGAAGCACC TGGAAATCAA GGAGCGGTTT TATTCTCGCG 
GCAATAGGCT CAGCCGTCGG CTTGGGCAAT ATCTGGAGGT TTTCCTACCT GACCTATGAG
AACGGAGGAG GAGCTTTCCT TATTCCCTAC CTCGTCGCAC TCTTTACGGC AGGCATACCC
CTGCTCATTC TCGAGTTCGG TATCGGGCAT GAACGGATCG GCAGTGCTCC CCTCGCATTC
AGGAAAATCG GAAGGAGATG GGAGTGGCTT GGATGGTGGC CTCTCATGTT TACCATGTTC
GGCATCGTGC TCTACTATGC TGTTGTCATT GCCTGGTGCA TCGACTTCAT CTTCTACTCC
ATCAACCTCT CCTGGGGGGA CGATCCCGAA GCCTTCTTTT TCCAGTCCTT TCTGAACCTC
AGTTCCTCTC CTGCTGAAAT AGGCAGCATC CAGACGCCCA TACTTGCCGG CCTGCTTGCC
GTCTGGCTGA TAACCTGGGG AATCAGCGTC AGGGGCGTAA GCCGGGGCGT GGAGCTGGCC
AACAGGATCT TCATGCCTCT GTTGCTGATC CTGACGCTGA TACTCGTGTT CTGGTCGGTC
ACGCTCGAAG GAGCTATGGT CGGTATTGCC GCCTATCTGC AGCCGGATTT CACAAAGCTG
GCCGATCCGA TGGTCTGGAT CAGCGCCTAC GGCCAGATAT TCTTTACCCT GAGCCTTGGT
TTCGGTATCA TGATCGCCTA TGCAAGCTAC ATGCCGCAAA ACTCCAACAT GACCGGCAGC
GCTTTGATTA CCGCACTGGC CAACAGCGGG TTTTCGCTTC TGTCCGGTTT CGGTGTCTTC
GCGGTTCTGG GTTTCATGTC CGTAAGCAGG AACCAGCCGC TCGACGAGGT GGTCAGGGAA
AGCATCGGTC TGGCCTTTGT TGCGTATCCA AAAGCCATTT CCCTGATCGG CGACGCGGGA
CCTCTTTTCG GGATGCTTTT CTTTCTGAGT CTCACGGTTG CCGGTATCTC CTCATCGATA
TCCATCGTCG AAGCCTTTGT GTCAGGAGCT GAGGACAAAT TCGGCATCAA CCGGAAAAAG
CTGTCGACGG TGCTCTGTTT TCTCGGATTT GTCGGCGGGA TCATCTTTAC GACTAACGGC
GGGCTGTTCT GGCTTGACAT CGTCGATCAC TTCATCAACA ACTACGGACT GGTGGTCGCT
GGCCTGCTCG AATGCATCGT GGTTGGCTGG TTTTTCAATC TCGACATCAT CAGAAAACAC
CTCAACAAGG TGTCGAATAT GCAGCTGGGG ACATGGTGGA ACTGGATCAT CAAACTCTGC
CTGCCCTTGT TCCTTTCCGT GATTCTCCTG AACCAGTTGA TACGGGAGCT CTTTACCGCA
TACGGCGACT ACAGCTGGGC AAGCCTCATA GGCATCGGCT GGAGCTGGAT CGGCATCACC
TTTCTCGTAT CCTTTATCCT CGCCGTCCAG CCCTGGAAAA CTCAACGTCA TCTGGAAAAA
GGAAGGGGCG AGGAACTCCT GCCTCCCGTG TGA
 
Protein sequence
MSSESAGRST WKSRSGFILA AIGSAVGLGN IWRFSYLTYE NGGGAFLIPY LVALFTAGIP 
LLILEFGIGH ERIGSAPLAF RKIGRRWEWL GWWPLMFTMF GIVLYYAVVI AWCIDFIFYS
INLSWGDDPE AFFFQSFLNL SSSPAEIGSI QTPILAGLLA VWLITWGISV RGVSRGVELA
NRIFMPLLLI LTLILVFWSV TLEGAMVGIA AYLQPDFTKL ADPMVWISAY GQIFFTLSLG
FGIMIAYASY MPQNSNMTGS ALITALANSG FSLLSGFGVF AVLGFMSVSR NQPLDEVVRE
SIGLAFVAYP KAISLIGDAG PLFGMLFFLS LTVAGISSSI SIVEAFVSGA EDKFGINRKK
LSTVLCFLGF VGGIIFTTNG GLFWLDIVDH FINNYGLVVA GLLECIVVGW FFNLDIIRKH
LNKVSNMQLG TWWNWIIKLC LPLFLSVILL NQLIRELFTA YGDYSWASLI GIGWSWIGIT
FLVSFILAVQ PWKTQRHLEK GRGEELLPPV