Gene Cag_0195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0195 
Symbol 
ID3746682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp222055 
End bp223461 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content46% 
IMG OID637772722 
Productsodium:solute symporter family protein 
Protein accessionYP_378516 
Protein GI78188178 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCCAC TTGATACGGC ATTAGTGCTC CTTTTTCTTG TTGCCAACAT TGCATTTGGA 
TTGTGGCAAA GTAAGTCCAA CAAAAGCACA GGCGATTACT TTCTTGGTGG GCATAGCGTA
CCGTGGATTG TAGCAATGCT CTCCATTGTT GCTACCGAAA CCTCCGTGCT TACCTTTGTA
AGCGTACCCG GCTTGGCGTA CCGTGGCGAT TGGAGTTTTT TGCAATTGCC GTTGGGCTAC
ATTGTTGGGC GTGTGTTAGT TAGCATGTTT CTTTTGCCAC TCTACTTTCG TGAAGGGGTA
AGCTCTATTT ATGAAATTAT TGGGCGACGT TTTGGCACAG GAATGCAAAA GTTAGCCTCC
GTAGCATTTC TTATAACGCG CATTTTGGGC GATGGTGTAC GTTTTTTAGC AACGGGCGTA
GTGGTGCAAG CGGTAACGGG GTGGTCGCTG CCTCTCTCTA TTGTGCTTAT TGGCGTGGTT
ACGCTTATTT ACACCATTTC AGGTGGCTTA AAAAGTGTGG TATGGCTCGA CAGCTTTCAA
TTTGGCTTGT ACTTTCTCGG TGGCGTTATT TCCATTAGTT ACCTTTTGCA GCAGCTCGAT
GCCCCATTTC CCACGCTCTT TGCAACTCTT CATGAGGCAG GCAAGCTACA AGTGTTTCAA
TTCAGCAACG ACTTGCTTGT TAACCCTATG GCATTTGGAG CGGCATTTCT CGGCGGCGTT
TTTCTCTCTT TTGCCTCACA TGGCGTGGAC TTTATGATGG TGCAGCGCGT GCTGGGCTGT
CGTTCGTTAA GCAACGCTCG CAAAGCTATG ATTGCAAGTG GCTTTTTTGT CTTCTTTCAG
TTTGCCATTT TCTTATTAGC TGGCTCGTTA ATGTTTCTTT TTATGGAAGG AAGGGAAGTA
GAGAAAGATC GCGAGTTTGC CTTCTTTATT GTTCACCATC TGCCAACCGG TTTAAAGGGA
ATTTTATTAG CGGGAATTCT TTCGGCTGCC ATGTCAACCA TTGCCTCTTC AATTAATTCT
TTAGCCGCTT CTACGGTTAC CGATATGGCT GGTGGCAAGG TATCGCTTAC GGGATCGCGC
TGGATTAGCT TTGGTTGGTC ATTAGTGCTT ATTGCCATTG CGCTGCTTTT CGACGAAAGC
AATAAGGCAA TTATTATGGT TGGCTTAGAA ATTGCATCGT TTACCTATGG TGGATTGCTC
AGCTTGTTTT TGCTTTCTCG AAGCTCTCGC GCCTTTCATC CCGTAAGCCT TGCGGTTGGA
TTTCTGGCAA GTATGGCAGT AGTGGTTTTG CTCAAATATG TTGGGCTTGC ATGGAGCTGG
TATATTTTAC TCTCCGTGCT GCTGAATGTG CTCTTAGTGT ATGGCATTGA TATTGTTACC
AATACCATAT CACCAAAGCG TTTGTAA
 
Protein sequence
MQPLDTALVL LFLVANIAFG LWQSKSNKST GDYFLGGHSV PWIVAMLSIV ATETSVLTFV 
SVPGLAYRGD WSFLQLPLGY IVGRVLVSMF LLPLYFREGV SSIYEIIGRR FGTGMQKLAS
VAFLITRILG DGVRFLATGV VVQAVTGWSL PLSIVLIGVV TLIYTISGGL KSVVWLDSFQ
FGLYFLGGVI SISYLLQQLD APFPTLFATL HEAGKLQVFQ FSNDLLVNPM AFGAAFLGGV
FLSFASHGVD FMMVQRVLGC RSLSNARKAM IASGFFVFFQ FAIFLLAGSL MFLFMEGREV
EKDREFAFFI VHHLPTGLKG ILLAGILSAA MSTIASSINS LAASTVTDMA GGKVSLTGSR
WISFGWSLVL IAIALLFDES NKAIIMVGLE IASFTYGGLL SLFLLSRSSR AFHPVSLAVG
FLASMAVVVL LKYVGLAWSW YILLSVLLNV LLVYGIDIVT NTISPKRL