Gene Noc_1575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1575 
Symbol 
ID3705794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1748846 
End bp1750342 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content54% 
IMG OID637738055 
ProductSodium/proline symporter 
Protein accessionYP_343584 
Protein GI77165059 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0675536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGTCA CTTTTATCAT TTACCTCGCC GTCATGCTGC TGCTTGGCCT GCTGGCCTAC 
GTCCGGACCC GGGATTTATC CGACTATATC CTTGGGGGCC GCAAGCTCGG AAGTTGGACC
ACTGCCTTGA GTGCAGGGGC TTCTGACATG AGTGGTTGGC TCCTTTTGGG ACTACCTGGC
TACGCTTATC TTCATGGCTT GGAAGCTGGC TGGATTGCCC TCGGTTTATG GTTGGGCACC
TATGGCAACT GGCGCCTGGT GGCTGCCCGG TTGCGGGCCT ATTCCACGGC TGCTGGAGAT
TCACTCACCT TGCCGGAATT TTTGGCTCAG CGCTTTCATG ATCATCGCCA TCTGCTACGC
TGCGTTGCGG CAGCTTTCAT TCTGATTTTT TTCTTGTTTT ATACCAGTGC AGGCTTGGTG
GCGGGGGGAA AACTGTTCAA CGCCGTCTTC GGTCTACCCT ATACGTGGGC CGTAACCGCC
GGTACCCTGG CCATTATGAG CTATACTTTC GTCGGGGGAT TTCTCGCCGT CTCCTGGACC
GACCTGGTGC AGGGCCTATT GATGCTGCTC GCCCTGCTCG CCGTTCCTGT GGTCGCCATT
CTCCACCTGG GGGGCTGGCA GGCCACCTTG GCCGGCATTG ATGCTGAACG GCTCTACCTG
TTTGTAGCGA CGACCGATGA GCCCCTAGGG GCTATCGCTA TTCTCTCCCT GCTAGGATGG
GGGTTAGGGT ATTTCGGCCA ACCTCATATT CAGGTTCGTT TCATGGCCAT TCATTCCCTT
AACGCCGTGC CCCAGGCGCG GCGGATCGCC ATGGGTTGGG TGACCCTGAC CTTAATCGGC
GCCCTCTTGG TGGGAATTAC GGGAGCCTCC TTCTTACATC CACCCCTTTC CACGGCTGAC
AGCGAAAAAG TATTCATTGA AATGGTCAGT ATCCTCTTCC ACCCCTTGCC TGCCGGAGTC
CTACTAGCCG CTATCCTGGC AGCTATAATG AGCACCACCG ATTCCCAACT ATTGGTCTGC
TCGGCGGTAT TTACCGATGA CTTTTATAAA GCCCTCCTGC GCCATCAAGC AAGTGCTCGG
GAATTGGTCT ATGTGAGCCG GGCCACCGTC GTCATCATTG CTTCCCTGGC ATTATGGCTA
GCCCTCGATC CGGAAAGCCA AGTTTTGGAA CTAGTGGCCT ATGCCTGGGC TGGGTTTGGC
GCTGCTTTTG GCCCCACCCT CCTCATGGCC CTTTACTGGA AACGAATGAC CCGACAAGGA
GCACTAGCTG GAATTATCGT GGGAGGAATG ACGATTTTAT TATGGAAGCA GTTAAACGGG
GGAATTTTTG ACCTCTACGA ACTGGTGCCC GGCTTTATTT TCTCAGCTAT CGCCATTGTC
GGCGCTAGTT TGCTAAGCAC CGCCCCTGGA ATCGAGATCG AACGGCAATT TAATGCGATT
GCTAACAATA CCAAGCCTTT TAATACAAAC AATAATACTG AGGATCGCCA AACCTAA
 
Protein sequence
MIVTFIIYLA VMLLLGLLAY VRTRDLSDYI LGGRKLGSWT TALSAGASDM SGWLLLGLPG 
YAYLHGLEAG WIALGLWLGT YGNWRLVAAR LRAYSTAAGD SLTLPEFLAQ RFHDHRHLLR
CVAAAFILIF FLFYTSAGLV AGGKLFNAVF GLPYTWAVTA GTLAIMSYTF VGGFLAVSWT
DLVQGLLMLL ALLAVPVVAI LHLGGWQATL AGIDAERLYL FVATTDEPLG AIAILSLLGW
GLGYFGQPHI QVRFMAIHSL NAVPQARRIA MGWVTLTLIG ALLVGITGAS FLHPPLSTAD
SEKVFIEMVS ILFHPLPAGV LLAAILAAIM STTDSQLLVC SAVFTDDFYK ALLRHQASAR
ELVYVSRATV VIIASLALWL ALDPESQVLE LVAYAWAGFG AAFGPTLLMA LYWKRMTRQG
ALAGIIVGGM TILLWKQLNG GIFDLYELVP GFIFSAIAIV GASLLSTAPG IEIERQFNAI
ANNTKPFNTN NNTEDRQT