Gene Noc_1365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1365 
Symbol 
ID3706129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1512219 
End bp1513253 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content51% 
IMG OID637737860 
Productsolute/sodium symporter 
Protein accessionYP_343389 
Protein GI77164864 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID[TIGR00841] bile acid transporter 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAA CAGCTAATCG AATCACAGCT CTATTTCCAC TGCTGGCGCT CATGGGGGCT 
GGAGTGGCCT ACCAATATCC TGAACCCTGG GTAGTTCTCA AACCGGCTAT CGTACCCCTG
TTAGGTGTGA TCATGTTTGG CATGGGAATA ACTCTTAAAG CCAATGATTT CGTCCTAATT
CTCAAACAGC CCCAAGCTGT TGCGACGGGC GCCCTGTTAC AGTTTTTATT GATGCCTTTC
ATCGCCTGGA TAGTAAGCCA CCTTTTTAGC TTACCGGCAT ATCTTACCGT CGGCATGATT
CTGCTGGGAT GCAGCCCAGG AGGCACCGCC TCCAATGTGG TGTGTTATCT AGCCCGGGGT
GATGTGGCCC TCTCCATCAC CCTAACGGCT GCTTCCACGC TTCTATCGGT TCTTGCCACT
CCTTTTCTCA CCTGGCTTTA TGTGGGGCAG CAAGTCCCAG TGCCAGTAGC TGATATGCTG
CAAAGTATTT TGATAATCGT GCTGCTTCCT GTCACCTTGG GAGTTATCAT CAATACTTTT
TTCGGCCAAC GGCTAGGCAA GCTCACCGAT GTTTTTCCTG TCATTTCGGT CTTTGCCATT
GTGCTCATCG TGGCGATTAT CGTGGCCATT AACCAGGATA AACTGACCCT CATTGCTCCA
ACAATCGCCC TCTGCATCCT GTTACATAAT GGATTGGGCC TGGCAAGCGG TTATGGATTA
GCCCAGACCC TAGGTTTTAG TCAACGCCAA TCCCGCACCG TAGCCATTGA GGTAGGCATG
CAGAACTCAG GTCTAGCGGT GGCCCTGGCC TTAAAATACT TTACCGCCCA GGCCGCCCTT
CCCGGAGCTT TATTTAGTAT CTGGCACAAT CTCTCCGGGT CCCTGCTGGC TTACTACTGG
TCACACCGTT CCCAAGACTC CCCAGGGGAA CGATTAAAAG CCGACGCTCA TCCAGTCTGG
AAAAAGGCTT CCACCTCTTT AATATCCTGG CTTTGGGCTA TGCTTGGCAG GCTCTTCAGG
AACAAACGAC CATAG
 
Protein sequence
MATTANRITA LFPLLALMGA GVAYQYPEPW VVLKPAIVPL LGVIMFGMGI TLKANDFVLI 
LKQPQAVATG ALLQFLLMPF IAWIVSHLFS LPAYLTVGMI LLGCSPGGTA SNVVCYLARG
DVALSITLTA ASTLLSVLAT PFLTWLYVGQ QVPVPVADML QSILIIVLLP VTLGVIINTF
FGQRLGKLTD VFPVISVFAI VLIVAIIVAI NQDKLTLIAP TIALCILLHN GLGLASGYGL
AQTLGFSQRQ SRTVAIEVGM QNSGLAVALA LKYFTAQAAL PGALFSIWHN LSGSLLAYYW
SHRSQDSPGE RLKADAHPVW KKASTSLISW LWAMLGRLFR NKRP