Gene Cphamn1_1650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1650 
Symbol 
ID6375336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1781209 
End bp1782660 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content50% 
IMG OID642684144 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_001960050 
Protein GI189500580 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.888495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.420592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTGGA TTGACTACCT GATTTTCGGA GCTTATATGT CTGGAGTTCT GGCGATCGGA 
TTCTACTATT TCAGGAAAAA CTCCAATGAA GAAGATTATT TCGTGGGCAG CCGTGACATC
AACCCATACC ATGTCGGGCT CTCCATTGTA GCTACCGATG TCGGCGGCGG TTTTTCCATA
GGTCTTGGAG GAGTCGGTTT TCTTATGGGA CTCTCCGGGA GCTGGCTGTT GTTCACCGGG
CTCATCGGGG CATGGCTTTC AGCCGTATTT ATCATTCCGA AAATCAAGAA AATCGATCGC
GAAAAGGGAT TTATGACCTT TCCCGATTTT CTCAGGGACC GTTACGATAC AAGGGTAGCA
TTCATGGCTG CGCTGATTTC AGGGCTCGGC TATATGGGCT TCACCGGAGC ACAGATGCTG
GCGGGAGCGA AACTGGCTTC CGCCACGATA CTGCAAAACA ACCCCTTCGG TATGGAGCCG
GTGCTGTTTT CACTGTTGAT CATTGCGGTC GTTACTATTC TCTATACCGT TACAGGAGGC
CTGAAAGCGG TGATTTATAC CGATACGCTG CAGTGGATCG TCCTCCTGGC CGGCCTCATT
TTCATTGCAA TCCCCATAAC CCTCGGAAAG ATCGGTGGCT TCGAGGCCCT AAGAACAAAC
CTGCCGCCCG ACCATTTTTC ACTCACAGCC ATCAAGCCGA CAACGTTTAT CAACTGGATG
GTGACCATCA TCCCAATCTG GATAATCGGT ATGACGCTCT ATCAGAGAAT GTATGCCTGC
CGTAATGAAA AAGACGCAAA AAAAGCCTGG TACACCGCAG GACTGTTTGA ATATCCCGTC
ATGGCATTTT CAGGAGTTTT CCTTGGCATG TGCGCCAGGG TGGTCTTTCC CGAAGCCGAA
CCGGAAATGG CGCTTCCGAT GCTTGTACGG GACATGCTTC CTGCCGGAAT AACGGGCATC
ATCATCGCCG CATATTTTTC AGCGATCATG TCCACGGCCG ACAGTTGCAT GATGGCGTCA
TCAGGAAACT TCACCAGTGA TATCATCAGA CCTCTCATCC AGAAAAAAAG ACCTGGAAAA
GTCAACACAA TTCAGCTTTC CATGATCGTC ACCTTTGCGG TAGGCGCACT CGCGGTCATT
CTGGCTGCAC GTTTCACGTC TGTGCTCAAC GCGATTCTTT ACACCTATTC TTTCATGGTT
TCCGGATTGT TCGTCCCGAC GCTGGGAGCG TTTTTCTGGA AAAAAGGGTC CTCAATGGGA
GCCCTGGCAG GAATGGCCGG AGGAGGAACG CTCACATTGT TGGTCATCAG CGGGGTGTTT
GCACTTCCGA AGCAACTCAA AATGCTCGAA CTGGACGCCA CGATTTACGG CATCATCGTC
TCGGCTCTGC TCTTTTTCAG CGTATCACTT CTCTTTCCCG ATCCGGAAAA AAACAAACAG
ACAAAACAAT AG
 
Protein sequence
MSWIDYLIFG AYMSGVLAIG FYYFRKNSNE EDYFVGSRDI NPYHVGLSIV ATDVGGGFSI 
GLGGVGFLMG LSGSWLLFTG LIGAWLSAVF IIPKIKKIDR EKGFMTFPDF LRDRYDTRVA
FMAALISGLG YMGFTGAQML AGAKLASATI LQNNPFGMEP VLFSLLIIAV VTILYTVTGG
LKAVIYTDTL QWIVLLAGLI FIAIPITLGK IGGFEALRTN LPPDHFSLTA IKPTTFINWM
VTIIPIWIIG MTLYQRMYAC RNEKDAKKAW YTAGLFEYPV MAFSGVFLGM CARVVFPEAE
PEMALPMLVR DMLPAGITGI IIAAYFSAIM STADSCMMAS SGNFTSDIIR PLIQKKRPGK
VNTIQLSMIV TFAVGALAVI LAARFTSVLN AILYTYSFMV SGLFVPTLGA FFWKKGSSMG
ALAGMAGGGT LTLLVISGVF ALPKQLKMLE LDATIYGIIV SALLFFSVSL LFPDPEKNKQ
TKQ