Gene Noc_1385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1385 
Symbol 
ID3706110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1533903 
End bp1535630 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content48% 
IMG OID637737879 
Producthypothetical protein 
Protein accessionYP_343408 
Protein GI77164883 
COG category[S] Function unknown 
COG ID[COG4805] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA ATTTTTCTCC TTTTTTAGGG AGTGCAGTCC TTTTATTTTT GACTTTATTG 
TTATTATTCC CAGGCATTCC AGAGACACGA GCAGCAGAAA ATAATACTCA TTGGGATGCG
TTTGTTCACA ATTTTGTCGA AAAATATTTT GCTGCCAATC CTGACTTTGC GGTACGTGCC
GGTCGACATG AATTTGATGG CAAGTTGCCT GATTGGAGCC CCGAAGCGCT TGCTAAAGAA
GTAGCGCGAC TGCGGTCGGA GCGCCAGCGG GCGCTTGCTT TCGAGGTTGC TTCCTTAACA
GCAAGCCAGC GTTTTGAACG TGATTATTTG GTTGCCTGGA TTGATAAGGA TCTCTTCTGG
TTGGAAACAG CGGAGTGGCC CTATAGAAAT CCGGCATTTT ATACCCAAGA ACTTGATCCT
AATGTTTACC TAAGCCGTCC TTATGCCCCC TTAGAAGAGC GTATGCGTGC TTATATTGCT
TATGCTGAAG CTATTCCTGC CGCAGCCAAG CAAATTCGCC ATAACTTGAG AACCCCACTG
CCCCGAACCT ATGTGGATAT TGGCGAGAAG GTGTTTGGGG GACTTGCCGC CTACTATGAA
CGAGATGCGC CCGCCATTTT CAGCACTGTG GAGAATGAAC GGCTACAAAG AAAATTTCGG
GCAGCCAACC GGCATGCCAT TCGCGCGATG AAGGAATTGC AGCAATGGCT GCAGACCCAG
CGGACCAATG CCACTAGTGA CTTTGCCTTA GGCGCGCCTC TTTTTCGTGC TCTATTGCGC
GAGGCTGAGG GCGTGAAGAT TTCCCTTGAG CGGCTAGAAC AAATAGGGCG CCAAGATCTT
AAGCGTAATC TTGTCGCTTT ACAGAAAGCA TGCGGTAATT ATGCTCCTAG CAAAACCGTA
TCGGAATGCA TTGAAAAAGC ACGAGCCGTA AAACCTGAGA AAGGTCCTGT CGAAGAAGCT
CGCCGCCAGC TCCAGAAACT TAAGGAGTTT GTGATTGCTA AAGATTTAGT GACTATTCCT
AGTGCCGAGC AAGCCCAGGT GGCCGCCTCT CCCCCTTATA TGCAATGGAA TTTTGCTTAT
ATTGACATTC CTGGCCCTTT TGATAAAGGG CTGCCTGCTA TTTATTATGT GGCGCCTCCT
GATCCGGCCT GGTCAAAAGC GGAACGGGAG GACTACCTTG CGGATAAGGC GGACTTACTA
TTTGTATCGG TGCATGAAGT TTGGCCAGGC CATTTTCTAC AATTTTTGCA TTCCAATCGG
GTTGCCTCCC CCTTAGGAAA ACTTTTTGTG GGTTATGGTT TTGCTGAGGG CTGGGCCCAC
TATGTGGAAG AGATGATGTG GAAAGCTGGA CTAGGTCATG GCGACCCGGA AATCCATATC
GGCCAATTGC TTAATGCCCT ATTGCGGAAT GTGCGTTATT TGTCCGCGAT AGGATTGCAC
ACTCAAAGGA TGACTTTGGA AGAGTCGGAG CGCATGTTCC AGGAATTTGC TCATCAGGAT
GTAGGTACGG CAAGGCAACA GGCAGCGCGG GGAACGTTTG ATCCAGCTTA CATCACCTAC
ACCCTAGGGA AATTGATGAT TAAGAAGTTA CGGGAAGAAT GGACCGCTAC CCGAGGAGAA
CGCGAGGGAT GGAGGGTATT TCATGATAAA TTTCTTTCCT ATGGTGGGCC GCCTATCCCC
TTAATTCGGA AAGAGATGCT TGGAGAAAAT GCGGGTCCTG CTCTTTAA
 
Protein sequence
MRKNFSPFLG SAVLLFLTLL LLFPGIPETR AAENNTHWDA FVHNFVEKYF AANPDFAVRA 
GRHEFDGKLP DWSPEALAKE VARLRSERQR ALAFEVASLT ASQRFERDYL VAWIDKDLFW
LETAEWPYRN PAFYTQELDP NVYLSRPYAP LEERMRAYIA YAEAIPAAAK QIRHNLRTPL
PRTYVDIGEK VFGGLAAYYE RDAPAIFSTV ENERLQRKFR AANRHAIRAM KELQQWLQTQ
RTNATSDFAL GAPLFRALLR EAEGVKISLE RLEQIGRQDL KRNLVALQKA CGNYAPSKTV
SECIEKARAV KPEKGPVEEA RRQLQKLKEF VIAKDLVTIP SAEQAQVAAS PPYMQWNFAY
IDIPGPFDKG LPAIYYVAPP DPAWSKAERE DYLADKADLL FVSVHEVWPG HFLQFLHSNR
VASPLGKLFV GYGFAEGWAH YVEEMMWKAG LGHGDPEIHI GQLLNALLRN VRYLSAIGLH
TQRMTLEESE RMFQEFAHQD VGTARQQAAR GTFDPAYITY TLGKLMIKKL REEWTATRGE
REGWRVFHDK FLSYGGPPIP LIRKEMLGEN AGPAL