Gene Noc_1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1942 
Symbol 
ID3705479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2222474 
End bp2223949 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content54% 
IMG OID637738418 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_343934 
Protein GI77165409 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGAAA GCAAGACTGA GCCTGGTGTT CGGCACTCGA TAGTGGTTAC AACGTTGAAC 
CGCTATGCAG TTTTAGTGAT TTCCTTGGTC TCCACGATGG TGTTGGCGCG CTTGCTTACG
CCGGCCGAGA TCGGCATTTT TTCCATGGCG GTGGTCTTCG TGAACCTTGC CCATTCCATG
CGTGACTTTG GCGTAGGCCG GTACATCGTT CAGGAGAAGG AGCTTACCGT AGATCGTATC
CGGTCCGCCT TCGGAATCAC GTTGGGTATC GCCTGGTCTA TGGCCATTGT GCTAGCAATT
GCAGCGCCTT GGGTTGCTGA TTTTTATGGG GATGAAAGGG TGACAGGAAT CCTGCGGGTG
CTTGCCGTTA ATTTCGTTTT AATTCCGTTT GGCTCCGTGG TTCTCTCTTA CTTGAACAGG
GAAATGCAGT TTACCACTAT TTTCCTGGTG GGCGTCATTA GTGAATTTGT CCGGGCTGCG
AGCGGTATTT GGTTTGCATG GATAGGACTT GGCGCTATGA GTTTAGCATG GAGCGCACTT
CTTGGGGTAA TTGCCACAGT AGTATTGGCG CGTATTCTAG GGCCTAGCCA CTTCATCCTG
CGCCCAGCAT TTTGTGAATG GCGGAGGGTT ATGAGCTTTG GCGGACGGGC CACGCTTGCA
ACCATTGCAT TCCAGTTTCA GCGGGGAGCG CCTGAGGTAG TCATTGGCCG GTATTTGAGT
GCTGCAGCGG TAGGTTTCTT CAGCAAAGCT CTTGGGGTTA TTCGGCTCTT CGACCGCACG
GTGCTATCGG CGGTAAGCCC GGCAATTTTG CCCCACATGG CGGCCAAGCA CCGCTCGGGA
GAGAGTGTCG CGGGTTTTTA TGCCCATGGG CTTGGGTTAA TTACCGCCCT CGCGTGGCCA
TGCTATGCTT TTATCGCCAT CATGGCGTTC CCGGTGGTAC GAATTCTGTT CGGCGATCAG
TGGGATGCTG CGGTTCCCCT GGCGCGCATT CTGGCCATCT ATGCGGCGGT CGATGCTTTG
TATGCGTTTA CCGCGCAGGC GCTGATTGCG GTCGGTGCGG TACACCTGCT GGTGCGGCTA
AGGGTGGCTA CCCTTTTAGC GACGGTTTTG GCGCTGGTGC TGGCTGTTTC TTATGGGTTA
GAAGTTGTTG CTTTTGCGAT GGTTTTTCCG GCTGTAGTGG GGCTGATCTA TTCCTCTTTG
CTGATGCGTT CAGCTATTGG TCTCAAGGGT AGGGTTTACC TAAAGGCCAC TGCCGCAAGC
TTATTGATTA CCGCTGCCAC AGTAGCGTTC CCCTTGTTCT ATCTGGGGAT GCCGGCGGCA
GTTGGGCAGC CCCATTGGCA ATTTTTTATT ATTAGTGCTG CCGGCGGCAG TGCAGGCTGG
ATGGTGGCTG TAATTACGCT TCGTCATCCT ATTTGGGACG AGTTGAGACT TCTATTTTCC
CAGGCTCGGA ATCGGTTATG GCCGGTTAGC AGTTAG
 
Protein sequence
MLESKTEPGV RHSIVVTTLN RYAVLVISLV STMVLARLLT PAEIGIFSMA VVFVNLAHSM 
RDFGVGRYIV QEKELTVDRI RSAFGITLGI AWSMAIVLAI AAPWVADFYG DERVTGILRV
LAVNFVLIPF GSVVLSYLNR EMQFTTIFLV GVISEFVRAA SGIWFAWIGL GAMSLAWSAL
LGVIATVVLA RILGPSHFIL RPAFCEWRRV MSFGGRATLA TIAFQFQRGA PEVVIGRYLS
AAAVGFFSKA LGVIRLFDRT VLSAVSPAIL PHMAAKHRSG ESVAGFYAHG LGLITALAWP
CYAFIAIMAF PVVRILFGDQ WDAAVPLARI LAIYAAVDAL YAFTAQALIA VGAVHLLVRL
RVATLLATVL ALVLAVSYGL EVVAFAMVFP AVVGLIYSSL LMRSAIGLKG RVYLKATAAS
LLITAATVAF PLFYLGMPAA VGQPHWQFFI ISAAGGSAGW MVAVITLRHP IWDELRLLFS
QARNRLWPVS S