Gene Noc_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1010 
Symbol 
ID3707271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1117373 
End bp1118662 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content55% 
IMG OID637737515 
Productmajor facilitator transporter 
Protein accessionYP_343048 
Protein GI77164523 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAAT CTGGCCATCA TTCAACGCCC GCTTCTAAAG CAAGCCTTAT CTCCTGGGCT 
TTATACGATT GGGCCAATAG CGCCTTTGCC GCCGTTATCA CCACCTTTGT GTTCGCCGCC
TATTTTACCC GGCAAGTGGC GGAAAACGAG ACTCTTGGCA GCGCCCAATG GGGGAACATA
GTGGGCATCT CCGGGCTTGT TATCGCCATT ACGGGACCGC TCCTGGGGGC CATTGCCGAC
CAAGGGGGAC GCCGCAAGCC CTGGATTATC GTCTTCACCT TATTGTGCGT TATAGCCACG
GCGCTTCTAT GGTTTATCAA ACCTACGCCT GACTATGCCT GGCTGGCACT GCTACTAGTT
GGGCTAGGCA CCCTCGGCGC TGAATTTGCT TTCATCTTTT ACAATGCCAT GCTGCCCGGC
TTGGCGGGAC CGAAATATGT AGGGCGGTGG TCCGGCTGGG GCTGGAGTAT CGGCTATGCA
GGTGGCGTAG CCTGTCTAAT CGTCGCCCTC TTTGCCTTCA TCCAAGGGGG AAATCATTGG
TTTGGCCTGG ACCCCGATTC CGCTGAGCCT GTGCGCGCTA CCTTTCCCCT GGTCTCCGGG
TGGTACTTAC TGTTTGCCCT CCCCTTGTTT CTCATCACAC CCGATACCCA AGGCACCGGC
AAACCCCTCT GGCGGGCAAC GAAAGATGGA ATGAGGCAGC TTTATGACTC CATTCGCCAT
GTACGCCAGT ACAGCACTAT CGCTCGCTTC CTTATTGCAC GCATGTTTTA TATCGACGGT
CTGGCAACTT TGTTTGCTTT TGGCGGTGTC TATGCGGCCG GAACCTTCGA CATGGACGAG
CAAGAAATAC TCCTGTTTGG AATCGCCCTT AACGTCACTG CTGGCCTGGG AGCCGCGGCT
TTTGCCTGGA TAGACGACTG GATAGGCAGC AAAAAGACCA TCCTGTTATC CCTGATTAGC
TTGATTTTGC TGACCACCCT GATCCTGATC GTGGAAACCT CGACCCTCTT TTGGACCTTT
GGACTCCTGC TCGGAATATT TGTGGGACCG GCCCAAGCCG CAAGCCGATC TTTTTTAGCA
CGAGTGGCGC CAGAGTCCTT GCGCAATGAA ATGTTCGGCT TGTTTGCCCT TTCTGGCAAA
GCGACCGCCT TCCTAGGTCC CTTATTGGTG GGCTGGATCA CTTACCTGGC GGGCAGCCAG
CGAATTGGCA TGGGCGCTAT CGTCATTTTT CTTCTCGTTG GCTTTGTGCT AATGCTGACC
GTCCCAGCCG CTAAAAAACC AGAAGAATAG
 
Protein sequence
MTQSGHHSTP ASKASLISWA LYDWANSAFA AVITTFVFAA YFTRQVAENE TLGSAQWGNI 
VGISGLVIAI TGPLLGAIAD QGGRRKPWII VFTLLCVIAT ALLWFIKPTP DYAWLALLLV
GLGTLGAEFA FIFYNAMLPG LAGPKYVGRW SGWGWSIGYA GGVACLIVAL FAFIQGGNHW
FGLDPDSAEP VRATFPLVSG WYLLFALPLF LITPDTQGTG KPLWRATKDG MRQLYDSIRH
VRQYSTIARF LIARMFYIDG LATLFAFGGV YAAGTFDMDE QEILLFGIAL NVTAGLGAAA
FAWIDDWIGS KKTILLSLIS LILLTTLILI VETSTLFWTF GLLLGIFVGP AQAASRSFLA
RVAPESLRNE MFGLFALSGK ATAFLGPLLV GWITYLAGSQ RIGMGAIVIF LLVGFVLMLT
VPAAKKPEE