Gene Cag_1384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1384 
Symbol 
ID3746568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1848972 
End bp1850738 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content48% 
IMG OID637773920 
Productsodium:solute symporter family protein 
Protein accessionYP_379685 
Protein GI78189347 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAACTC TTACCCTTCT CGACTATAGC TTTATTGGCG GCTACATGCT GCTCACGCTT 
TTTATTGGCT TGTGGTTTTC AAAACGAGCT TCCGAAAATG TAGGAGAATT TTTTCTTTCG
GGGCGACAAC TACCTTGGTG GATTGCAGGC ACCGGCATGG TTGCCACTAC CTTTGCGGCT
GATACTCCGT TAGCGGTTGC AGGTTTTGTA GCAAAACATG GCATTGCCGG CAATTGGGTG
TGGTGGACCT TTGTTTCAGG CGGCATGTTA ACCGTCTTCT TTTTTGCACG CCTCTGGCGA
CGAGCTGAAA TTCTTACCGA TCTTGAATTT ATTGAGCTGC GTTATAGTGG CGCACCTGCT
CGCTTTTTAC GTGGCTTTAA AGCCATTTAC TTCGGGCTTT TTATTAACGC CGTGATTATT
GGCTGGGTTA ACCTTGCCAT GTTTAAAATC ATCCGCATTA TGGTACCCGA ATTGCCACCC
GAAATAACCA TTGTGGCGCT TGTACTCTTC ACCACCTTTT ACTCAGGCTT ATCGGGGCTT
TGGGGCGTTT CGATTACCGA TGCCGTACAG TTTGTAATTG CAATGGTGGG CTGCATTATT
CTTGCCGTAC TTGCCGTTCA ATCGCCTGCC GTAGTGTCGG CTGGAGGATT AACAGGCGCC
TTACCCGCAT GGATGTTCGA CTTTTTCCCC AACTTTAGCC ATAGTGCGGA GGAGAGTAAC
AGCGTTACCG GAGCGATGTC GCTACCGTTG CTCTCGTTTG TAGCAATGGC GTTTGTGCAG
TGGTGGGCAT CGTGGTACCC GGGCGCTGAA CCCGGTGGCG GCGGCTACAT TGCCCAGCGC
ATGATGAGCG CTAAAGATGA AAAGCATTCG CTGCTTGCAA CGCTGTGGTT TACCGTAGCT
CACTACTGCT TGCGCCCTTG GCCTTGGATA CTTGTAGGGC TTGCAAGCTT AGTCATGTTC
CCCAACCTTC CAGCCAATCA AAAAGAGGAT GGCTTTGTGT ATGTTATGCA AGCCGTCTTA
CCGCCCGGCT TAAAAGGATT GCTGATTGCC GCTTTTCTTG CCGCATACAT GTCAACCCTT
TCAACGCACC TCAATTGGGG CACAAGCTAC CTTATTAACG ACTTTTACCA ACGTTTTGTA
AAACGCGATG GCACCCCGCA GCACTACGTA CTTGCATCAA AAATCACCAC CTTTTTAACC
GCAGCCTTTG CACTCTACAT CACCTTTTTT GTGCTCGAAA CCATTACAGG CGCATGGGAA
TTTATTATTC AATGCGGCGC AGGCACAGGC TTTGTGCTTA TTATGCGCTG GTTCTGGTGG
CGCTTAAACG CATGGTCAGA AATTACCGCA ATGGTGGCTC CCTTTATTGC CTTTACGCTG
CTCCAGCAAT TCACCACCAT AACCTTTCCA ATCTCCCTTT TTATTATTGT GGGCGTAACA
ATTACCGCCA CACTTGTAGT AACGTTTGCA ACCAAACCAA CCGAGCCAGC ACAACTTGAA
ACCTTTTACC GCACCACTCG TGTTGGCGGC AGGCTATGGA AAAAAGTATC GGATACTTTG
CCTGACGTGC AATCAGATAG TGGATTTGGT ATGTTGCTTG TTGATTGGGC ACTTGGCGTA
GTGATGGTTT ACACCATTTT GTTTGGCACA GGGCGTGTAA TTTTTGGAGA AATAGGAACG
GGTATTCTCT TTCTTGCCAT CGGCGCTATA GCAGGAACGC TTATTTTTGT TGATCTTAAC
CGACGTGGAT GGAATAATTT GCAGTAA
 
Protein sequence
MPTLTLLDYS FIGGYMLLTL FIGLWFSKRA SENVGEFFLS GRQLPWWIAG TGMVATTFAA 
DTPLAVAGFV AKHGIAGNWV WWTFVSGGML TVFFFARLWR RAEILTDLEF IELRYSGAPA
RFLRGFKAIY FGLFINAVII GWVNLAMFKI IRIMVPELPP EITIVALVLF TTFYSGLSGL
WGVSITDAVQ FVIAMVGCII LAVLAVQSPA VVSAGGLTGA LPAWMFDFFP NFSHSAEESN
SVTGAMSLPL LSFVAMAFVQ WWASWYPGAE PGGGGYIAQR MMSAKDEKHS LLATLWFTVA
HYCLRPWPWI LVGLASLVMF PNLPANQKED GFVYVMQAVL PPGLKGLLIA AFLAAYMSTL
STHLNWGTSY LINDFYQRFV KRDGTPQHYV LASKITTFLT AAFALYITFF VLETITGAWE
FIIQCGAGTG FVLIMRWFWW RLNAWSEITA MVAPFIAFTL LQQFTTITFP ISLFIIVGVT
ITATLVVTFA TKPTEPAQLE TFYRTTRVGG RLWKKVSDTL PDVQSDSGFG MLLVDWALGV
VMVYTILFGT GRVIFGEIGT GILFLAIGAI AGTLIFVDLN RRGWNNLQ