Gene Noc_1626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1626 
Symbol 
ID3705690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1817219 
End bp1818868 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content48% 
IMG OID637738101 
Productsulphate transporter 
Protein accessionYP_343630 
Protein GI77165105 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAGC ACATTGAAAC CCTTGAGGCT CCTAGAAAAG GCTTTGCCGG CTTGAAGGAA 
AATTTTGCTG CCGATTTAAG GGCGGGGCTA AGTATTTCTC TCATTGCCCT ACCCTTGAGT
CTAGGGATCG CTTTAGCTTC TGGTTTTCCG GCTTTTGCCG GTCTTATCGC AGCCATTGTA
GGCGGTATAC TTGTTTCCCG CATCAGTGGC TCTTATGTCA CCATTAATGG GCCGGCGGCC
GGGCTAATTG TAGCCAATCT GGCGGCTATC CAAAGTCTCG GCCAAGGGGA TATCCACGCT
GGTTATCTTT ACGCTATCGC CGCGGTCTTT GTAGCCGGCA TCATGGTTTT TATAATTGGT
GCCGCCGGGG CAGGAAAACT CGTAGATGTC TCCCCCAGTT CGGTGATTCA TGGCATGTTA
ACCTACATTG GGGTGGTTAT CATGGCCAAA ATGTTCTTTC CCATGATGGG GGTCATTCCT
GAAGTCCATT CAATTCTAGG CTCTAATGTG GTTGGCACCA TAGCCGCTAT TCCTCAAGGT
TTTACCAAAA TGCTGCCCCC TGTTGCCATC GTAGGGTTTG TTTCTTTGCT CATCATGGCT
ATCCACCCGA TGATTAGAGT CAAATGGGTG CAATTAATTC CAAGCCCGGT ATGGGTACTG
ATATTTGCTA TCCCCGCAGG CGTGCTTTTT GATCTGGAAA CCTTGCAGCA ACAACTCAAC
CTTCCAGAAG GGAAAGAACT GCTGCTAGCC CTACCGGGCA ACCCCCTCGA TGCCGTTGCT
TGGATCGGTG CGGTTACCCC GGATTTCGGT AAAATATTAA CCTGGGCTTT TTGGTACGCC
GCTCTCACCA TCGCCCTTAT CACGGCCATT GAATCCGGTC TTAGCGCCAA AGCGGTGGAT
CAATTAGATC CTTATCAACG GCACTCAGAT ATCGGCAAAG ATATCCGGGG AGTGGGTATA
GGCAGCGCCG TTTCCGGTAT TCTCGGCGGC TTGCCCATGA TCGCTGAAAT TGTCCGCAGC
AAAGCCAATG TCCTGATGGG GGCAAGAACC GGTTGGGCCA ATTTCTTTCA TGGAACCTTT
ATCCTGATTT TTGTTTTTGC ATTGTCCCCA GTCATGCAAA TGATTCCTGT TGCAGCACTG
GCAGCTATGA TGGTATTCGT GGGATATAAA CTTGCTGCTC CCGGCGAGTT CATTGGTATC
TTTAAAATTG GCCGGGATCA ATTTCTCTAT TTCATATTCA CCTTGCTTGT TTGTATCTTC
ACTAATCTGC TTGTCGGTGT TTTCGCTGGT ATTATTTTCA AATTCCTCTA CCAATTGCTC
GTGATGAGAG CGCCAACGTC TACTCTTTTC AAAGCAGATT TAACGGTAGA TCAAAGTGAT
GAGGGTAAGG ATGAATACCG GGTTAAAGTG AGAAAAGGAG CAACCTTTAC TAACTTTCTT
TCTTTTAAAC GCCGGTTAAG CCAACTACCA AAGGGCAAGA AAATCACGGT TGATTTCTCC
GAAGCTAAAG TAGCGGATTT CACCTTTCAA AGCGCGCTAC ACCATTATGC TAAACTCTAT
CAGGCAACTG GAGGATCAAT AGAACTAACT GGGCTCGATC AGCTTAAAGC CTACTCCAAC
CATCCTCAAT CAACTCGCTA TCGGCGTTAG
 
Protein sequence
MAQHIETLEA PRKGFAGLKE NFAADLRAGL SISLIALPLS LGIALASGFP AFAGLIAAIV 
GGILVSRISG SYVTINGPAA GLIVANLAAI QSLGQGDIHA GYLYAIAAVF VAGIMVFIIG
AAGAGKLVDV SPSSVIHGML TYIGVVIMAK MFFPMMGVIP EVHSILGSNV VGTIAAIPQG
FTKMLPPVAI VGFVSLLIMA IHPMIRVKWV QLIPSPVWVL IFAIPAGVLF DLETLQQQLN
LPEGKELLLA LPGNPLDAVA WIGAVTPDFG KILTWAFWYA ALTIALITAI ESGLSAKAVD
QLDPYQRHSD IGKDIRGVGI GSAVSGILGG LPMIAEIVRS KANVLMGART GWANFFHGTF
ILIFVFALSP VMQMIPVAAL AAMMVFVGYK LAAPGEFIGI FKIGRDQFLY FIFTLLVCIF
TNLLVGVFAG IIFKFLYQLL VMRAPTSTLF KADLTVDQSD EGKDEYRVKV RKGATFTNFL
SFKRRLSQLP KGKKITVDFS EAKVADFTFQ SALHHYAKLY QATGGSIELT GLDQLKAYSN
HPQSTRYRR