Gene Noc_1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1600 
Symbol 
ID3705762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1779335 
End bp1780756 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content55% 
IMG OID637738076 
ProductNa+/solute symporter 
Protein accessionYP_343605 
Protein GI77165080 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0101485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCATCG TCCTCAATTT TTTATTCTTT TTAGCGATTT TTGCGGGAGT TGGTTTGTTG 
TCGGCACGCA AGTCCCAAGG CACCCGCCAT GATTACTATA TGGCCAACAA TTCGGTAAAA
CCTTGGCTAG TGGGTTTATC GGCGGTCGCC ACTAACAACA GCGGCTACAT GTTTATCGGG
GTTATCGGTT ACACCTATCT GACCGGTTTG GCGTCGCTAT GGCTGGTGAT CGGCTGGATT
CTGGGGGACT TTATCGCCTC CCAATTGGTG CATCGGCATC TGCGGGAAGC CACGGTTCGC
ACCGGCGAGG TCACCTATGG GGGCGTGCTG AGTCAGTGGT ATGGCGCCGA ATTGGCGGGA
CTGCGGCGAA TAGCAGGCCT TCTCACGGTC ATTTTTCTGG GTATCTACGC GGCAGCTCAG
CTAAATGCTG GCAGTAAGGC CCTGCATGTA TTATTCGACT GGCCTTTCTA CGCGGGCGCA
GTGATCGGCG CGGTGCTAGT AGTGGGTTAT TGCTTTGCCG GCGGCATCCG TGCTTCCATC
TGGACGGATG CCGCCCAATC CTTTGTCATG TTCGGCGCCA TGCTGACCTT GCTTTACGCG
GCCGTCATGG CTTTGGGGGG CCCGCAAGGC GCCTGGGGAG AAATGGGTAA AATCAAAGGC
TTTCTGGACT GGTCCCCCGC AGATACGCTG ATTCCAGGGA TGGCGGGGCT TGCCTTTTTT
GCTCTGGGCT GGTTTTTTGG TGGTTTTTCC GTGGTGGGCC AGCCCCACAT CATGATCCGT
TTTATGGCCC TGGATAATCC TAGCCATATG GCCCGGGCGC GGCTCTATTA TTATCTCTGG
TATACCCTAT TCTACCTGTT GGCCACAGGC GTGGGCATGC TCTCCCGGGT GTATTTACCA
GAAGCACAGA ACTTTGACCC GGAACTGGCC CTGCCCACCA TGGCCCTGCA ACTATTACCT
GATATGCTGG TAGGATTGAT CCTGGCCGGT ATTTTTGCAG CCACCATGTC CACGGCGGAT
TCCTTGATAT TATCTTGTTC AGCGGCCCTT ACCCATGATT TACTGCCCCA CCAATTTGAG
AATATGGGCA AGATAAAGCT GGCCACGGTC GTGGTAACGG CCCTAGCCTT AGCCATTGCT
TTGAGCAGCA ACGAAAGCGT GTTTACGTTG GTGATTTTGT CTCTCTCCTT CCTGGCATCA
GCTTTTGTGC CTTTATTATT GATTTACACT CTCGGCGGCC AGCCCACGGA CCGGCAGGCC
TTAATCATTT TAGGAGCGGG CCTCGGCGTG GCCATAGTCT GGCGCTGGCT GGGCTTCCAT
CACGCCCTCT ACGAAGGGAT GCCCGGCATT CTGGCGGGCC TACTGGCTTT TGGCATGCTG
CGACTGTTCG GAAAAGTCGC TAGAAGCTTG GTAAGATCAT AG
 
Protein sequence
MIIVLNFLFF LAIFAGVGLL SARKSQGTRH DYYMANNSVK PWLVGLSAVA TNNSGYMFIG 
VIGYTYLTGL ASLWLVIGWI LGDFIASQLV HRHLREATVR TGEVTYGGVL SQWYGAELAG
LRRIAGLLTV IFLGIYAAAQ LNAGSKALHV LFDWPFYAGA VIGAVLVVGY CFAGGIRASI
WTDAAQSFVM FGAMLTLLYA AVMALGGPQG AWGEMGKIKG FLDWSPADTL IPGMAGLAFF
ALGWFFGGFS VVGQPHIMIR FMALDNPSHM ARARLYYYLW YTLFYLLATG VGMLSRVYLP
EAQNFDPELA LPTMALQLLP DMLVGLILAG IFAATMSTAD SLILSCSAAL THDLLPHQFE
NMGKIKLATV VVTALALAIA LSSNESVFTL VILSLSFLAS AFVPLLLIYT LGGQPTDRQA
LIILGAGLGV AIVWRWLGFH HALYEGMPGI LAGLLAFGML RLFGKVARSL VRS