Gene Noc_2215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2215 
Symbol 
ID3705095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2558305 
End bp2559465 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content54% 
IMG OID637738691 
Producthypothetical protein 
Protein accessionYP_344205 
Protein GI77165680 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCTATT TATTGCTTTT AATCATCAGA AACGCATTCC GCCAAAAGCT CCGCACCATC 
CTGACCATGG TGGGGATCGT CATTGCAACG GTGGCCTTTG GCCTGCTGCG CACGGCGGTT
GAGGCTTGGT ATGCGGGAGC GGAGGCAGCT TCGGCCACCC GCCTTATCAC CCGCAACGCC
ATCTCCCTAG TTTTTCCCCT GCCCCTGAGC TACCAAAACA AAATCCAGCA AATCGAGGGG
GTCGCCACCA CCAGCTATTC CAATTGGTTT GGAGGCGTTT ATATCAGCGA AAAGAATTTT
TTTCCCCAAT TCGCCATCGA ACCCCGCAGC TATCTTAAAC TTTATCCGGA ATATCTCCTC
TCGCCCCAAG AGGAAAGAGA TTTTTTTCGC GATCGCCAAG GCGCTATTAC AGGCGAAAAG
CTGGCGCAAA AATACGGCTG GAAAATCGGC GATGTCATCC CCATCCGGGG GACGATCTAC
CCCGGCGACT GGAATTTTAT CCTGCGGGGC ATTTATAAGG GAGCCAGCGA GAGAATTGAT
GAAACCCTCT TCCTTTTTCA CTGGGAATAT TTAAATGAAA AATTAAAACA AACCGGCGCT
GAACGGGCCA ATCACGTGGG CGCCTATGTG GTCGGCATTG AACAGGCCAG CCAGGCCGCC
CAAATCTCTC AAGCTATTGA CGGTCTTTTT GCCAACTCTC TGGCGGAAAC CCTCACGGAA
ACCGAAAAAG CCTTTCAGCT TGGTTTTGTC GCCATGACCG AAGTTATTGT GACCATTATT
GAGGTGGTTT CCTTCGTTAT TCTCATTATT ATTTTGGCGG TCATGGCCAA CACCATGGCG
ATGAGCGCCC GGGAACGCAA GCGGGAGTAC GCCACGCTCA AGGCACTCGG ATTTCCTGGC
AGTTTTATTG CTTTATTGAT TACCGGGGAG TCGATGGTGA TTGCCCTAGT GGGGGGTCTC
TTCGGACTGT TGCTGCTGTA TCCCGCGGCA GATAGCTTTG CCAGTAAAAT AGGCACTTTC
TTTCCCGTCT TCCGGGTGAC CCCGGAAACC GCCTGGCTGG CCATGGGCAT TGCCCTCGTG
GTGGGTCTCG CCGCCGCCGC CATTCCTGCC TGGCGGGGCG CGGCAGTATC GGCCACGGAA
GGGTTCCGGC AAATCGGTTA G
 
Protein sequence
MRYLLLLIIR NAFRQKLRTI LTMVGIVIAT VAFGLLRTAV EAWYAGAEAA SATRLITRNA 
ISLVFPLPLS YQNKIQQIEG VATTSYSNWF GGVYISEKNF FPQFAIEPRS YLKLYPEYLL
SPQEERDFFR DRQGAITGEK LAQKYGWKIG DVIPIRGTIY PGDWNFILRG IYKGASERID
ETLFLFHWEY LNEKLKQTGA ERANHVGAYV VGIEQASQAA QISQAIDGLF ANSLAETLTE
TEKAFQLGFV AMTEVIVTII EVVSFVILII ILAVMANTMA MSARERKREY ATLKALGFPG
SFIALLITGE SMVIALVGGL FGLLLLYPAA DSFASKIGTF FPVFRVTPET AWLAMGIALV
VGLAAAAIPA WRGAAVSATE GFRQIG