Gene Noc_2446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2446 
Symbol 
ID3704603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2791077 
End bp2792552 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content53% 
IMG OID637738926 
ProductSodium/sulphate symporter 
Protein accessionYP_344430 
Protein GI77165905 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID[TIGR00785] anion transporter 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGAAA CTGCCACTAC CCCCCTTGCC CATCGCCGCC AATACCTTGG ATTTATCCTT 
GGTCCCTTAG TGATGATTTC TCTCCTAACG CTATCGCCCC CCACCGGCCT GGATATTGCC
GCTTGGCAAA CCGCCGCCAT TACTTTGCTA ATGGCTATCT GGTGGATTAC AGAAGCCTTA
CCCATTCCCG TTACCGCCCT CCTGCCTCTG GTTCTTTTTC CTCTGCTCGG CATTGCCGAT
ATCCAAGAAA CGGCAATTCC TTATGCCAAT CCACTTATCT TTTTGTTTAT GGGAGGTTTC
ATCATTGCAT TAGCAATAGA GCGCTGCGGC CTGCATCGGC GTCTTGCTTA TTGTATTCTC
AGCTGTATGG GGATCTCACC TGCAGCTACC CTGGGCGGCT TCATGGTTGT CAGCGCTTTT
CTCAGTCTCT GGATTAGCAA TACCGCCACC ACCATGCTCA TGCTGCCCAT CGCCTGCTCC
GTGATCGAAG GACTTCGAGG CACGAAAGCG GCTGCCCCCC TATCTTCTCC CTTTGCAGTA
GCCCTGCTGC TAGGCATTGC CTACAGCGCC AGCATAGGAG GCTTGGGAAC GCTGGTAGGG
ACGGTGCCGA ACGCGTTACT AGCAGGCTTT GTCTTGGAAA CCTACGGCTT TGAATTAGGG
TTTGCAGAAT GGATGCTGCT CGCCGCACCT TTTTTGTTGG TTCTCCTCGT CTTTGCCTGG
TTCATGCTTA CCCAAGTCGT ATTTTCTGGT TTTCACAGGG AACATGGCGA CCAGCGGGTC
CCGATGCGCC AAGCTCTAAG GCAGCTTGGT CCTGTATCTC GGGCTGAACG AAGAGTCACC
ATGGTTTTTG CCTTAGTCGC TATACTTTGG TTGCTACGCC CCCTAATAGC AACCCTCGCA
CCTAACGTGG CTCTTAATGA TCCAGGGATC GCTCTCTTTG GCGCCATGCT CCTGTTCCTT
ATTCCCCTCG ATTGGCGGCG GGGGCGTTTT CTCATGGATT GGAAGACTGC GGAGCAATTG
CCCTGGGGTA CCTTATTACT ATTTGGAGGA GGACTCAGTC TAGCTGCCAG TATCAACAAT
ACGGGCCTTG CCAGCTGGCT TGGCGGAAAC CTTACCCTAC TCTCCGGCCT GCCCGCTTGG
CTAATATTGC TGGGTATCGT AGCAGTCGTG ATATTCCTTA CTGAACTGAC CAGTAATACG
GCCACGACCT CCATATTTCT CCCTATCCTT GGCTCCGTCG CCTTAAGCTT AGGCCAGCCA
CCATTGCAAC TCTTGATCCC TGTTACCCTA GCGGTCAGTT GCGCCTTTAT GATGCCGGTA
GCCACGCCCC CCAATGCCAT TGTCTTTGGC ACCGGCCTGG TGAGTATTCC CCAGATGGCA
CGGGCAGGAT TTTACCTTAA CTTAGCGAGT ATGATAATCA TCACCGGGGT TAGTTATTGG
TGGGTACCCG TTCTCTTTAC CTCCCCTTTA CCCTAG
 
Protein sequence
MPETATTPLA HRRQYLGFIL GPLVMISLLT LSPPTGLDIA AWQTAAITLL MAIWWITEAL 
PIPVTALLPL VLFPLLGIAD IQETAIPYAN PLIFLFMGGF IIALAIERCG LHRRLAYCIL
SCMGISPAAT LGGFMVVSAF LSLWISNTAT TMLMLPIACS VIEGLRGTKA AAPLSSPFAV
ALLLGIAYSA SIGGLGTLVG TVPNALLAGF VLETYGFELG FAEWMLLAAP FLLVLLVFAW
FMLTQVVFSG FHREHGDQRV PMRQALRQLG PVSRAERRVT MVFALVAILW LLRPLIATLA
PNVALNDPGI ALFGAMLLFL IPLDWRRGRF LMDWKTAEQL PWGTLLLFGG GLSLAASINN
TGLASWLGGN LTLLSGLPAW LILLGIVAVV IFLTELTSNT ATTSIFLPIL GSVALSLGQP
PLQLLIPVTL AVSCAFMMPV ATPPNAIVFG TGLVSIPQMA RAGFYLNLAS MIIITGVSYW
WVPVLFTSPL P