Gene Cpin_3541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3541 
Symbol 
ID8359708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4412214 
End bp4414007 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content47% 
IMG OID644965712 
ProductNa+/solute symporter 
Protein accessionYP_003123206 
Protein GI256422553 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.165749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.131888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAT CAGCGATAGA TACGACGGTT ATATTCATTT TTTCAGCCTT TGTGCTGATT 
ATAGGATTGC TGTTTGCCCG TACCGGAAGA AACATGAAAT CTTTCTTTGC AGGCGGGGAG
GCCGTGCCCT GGTTTATCGG AGGCTTGTCC CTTTTTATGA GTTTCTTTTC TGCGGGTACT
TTTGTGGCCT GGGGTAGTAT TGCTTATAAG TATGGTTTTG TATCAGTCAC GATACAATGG
ACCATGTGTC TCGGCGGCAT TATTACTGCT TTTTTGTTAG CCCCCCGCTG GAAGCGTACA
GGCGCTCTGA CGGCAGCAGA ATTTATTCGT CAGCGGCTCG GCGCACCGGT ACAGAAATTT
TATGTCAACA TATTCCTGCT GGTATCCCTG TTCAATAAAG GCGCTGTATT GTATCCCGTA
GCAAAACTGG TCAGCGTATC ACTAGGCTTT CCCCTGACTA CCTGTACGAT CGTATTAGGG
TTGATGATGA TCGCCTACAC TGCTATTGGT GGTTTATGGG CGGTGATGGT GACCGATATT
CTACAGTTTG TAATTCTTAC GGCAGCCGTG CTGATGGTCT TACCCTTATC GCTTGATGCA
GCAGGTGGAT GGACTAAGTT CACACAACAG GTACCCGCTG ATTTCTTTAA TGTCATCAAC
GGAGAATACT CCATCGGTTT CATTCTCGCA TTTGCGTTAT ACCATATCAG CTACATTGGC
GGTAACTGGA CTTTCGTGCA ACGTTATACA AGCGTAGATT CTCCGAAATC AGCAAAGAAA
GTAGCGTTTT TATTTGCAGG GTTATACCTG GTTAGTCCGG TGATATGGAT GCTGCCACCT
ATGATTTACC GTAGTATCGA TCCGGCATTA ACCGGATTGA ATACGGAGAA TGCCTATCTG
CAGGTTTGCC GCTTAGTGTT ACCACCGGGT CTCATGGGAT TGATGCTGAC GGGTATGTAC
TTCTCTACAT CCGCGACTGC GAATACCACC TTAAATGTTA CTTCTGCTGT TATCACCAAT
GATATTTATA AGGCATTGAT TAATCCCGGG GCCAGCGACC GGCAGCTGAT CCGTGTGGCG
CGCCTTTCCG GTTTATTACT GGGAATAGGG ATGATTGGTG TGGCATTGCT TGTACCGGCG
GCAGGAGGTA TGATTGAAGT GGTGCTCAGT ATTGCAGCTA TTACTGGCGG TCCTTCTTTA
TTGCCGCCAC TGTGGGCATT GTTCTCAAAG AAACTGACGG GCAAAGCGGC TTATATTATT
TCGGGCGCCA GTTTGCTGGT CAACCTGTTG TTTAAACTGC TTGTACCGGC ATTGACTACA
CTGAAGCTGA GCCGTTCCGA AGAAATGCTC TTGGGTGTAG GATTACCATT CCTGTTATTG
CTACTCTATG AATTTGTCAT CGCGCCTACA ACAGCGGCGA AAGAATACCA GGATTACCGC
ATATTCCGTA CACGACAGCA ACAGGACGCG GCAGCACAAT CCGCAGCAGA AAAAGAAGCC
ATCCATAAAC AGAACCTTTT CGGGCTGCGC GTGATTGCAT TTTCACTGGC ATTTACTGCG
CTGCTGCTCT ACGTATTGAG TATACTGGCC TCCAGCGGTA ATATGCTGGT AGCAATGATT
GCAAGCGCCG TATTACTGGC GGCTATTATT CCGCTGCGTG CCGTGCGGAA GATCCGTAAA
CAACCCGAAT ACATGCTACA AAATAACATC CCTGTAGAGG ATAGTGATGT ACACCATCAC
CTGAAAAACA GGGAATCAAA AAACGAAGCT TATGATGAAA GAACAAGGCA GTGA
 
Protein sequence
MMKSAIDTTV IFIFSAFVLI IGLLFARTGR NMKSFFAGGE AVPWFIGGLS LFMSFFSAGT 
FVAWGSIAYK YGFVSVTIQW TMCLGGIITA FLLAPRWKRT GALTAAEFIR QRLGAPVQKF
YVNIFLLVSL FNKGAVLYPV AKLVSVSLGF PLTTCTIVLG LMMIAYTAIG GLWAVMVTDI
LQFVILTAAV LMVLPLSLDA AGGWTKFTQQ VPADFFNVIN GEYSIGFILA FALYHISYIG
GNWTFVQRYT SVDSPKSAKK VAFLFAGLYL VSPVIWMLPP MIYRSIDPAL TGLNTENAYL
QVCRLVLPPG LMGLMLTGMY FSTSATANTT LNVTSAVITN DIYKALINPG ASDRQLIRVA
RLSGLLLGIG MIGVALLVPA AGGMIEVVLS IAAITGGPSL LPPLWALFSK KLTGKAAYII
SGASLLVNLL FKLLVPALTT LKLSRSEEML LGVGLPFLLL LLYEFVIAPT TAAKEYQDYR
IFRTRQQQDA AAQSAAEKEA IHKQNLFGLR VIAFSLAFTA LLLYVLSILA SSGNMLVAMI
ASAVLLAAII PLRAVRKIRK QPEYMLQNNI PVEDSDVHHH LKNRESKNEA YDERTRQ