Gene Noc_0856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0856 
SymboltruD 
ID3707161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp936678 
End bp937754 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content55% 
IMG OID637737358 
ProducttRNA pseudouridine synthase D 
Protein accessionYP_342899 
Protein GI77164374 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGCGG ATGAGGCTGG GCAGCAAGTG CTTGCCTATG GCGGAGATCC GCCTTTGGCA 
ACCGCCCTGC TTCGGTGTCG CCCAGAGGAC TTCCAGGTGG TTGAGGAACT TCCCTTCGCT
CTCTCTGGGG AGGGCGAGCA TGTCTGGCTT CTGCTTTGTA AACGTAACAC TAATACTGTC
TGGCTAGCGC GCCAGCTTGC CCGCATTGCC GGAGTGCGGC TAGTAGATGT GGGTTACGCA
GGGCTAAAGG ATCGTCATGG GCTGACCACC CAATGGTTTA GCGTTAATTT GAGTGGAAAA
AAAGAGCCAG CCTGGGCTAC AGCGTTGGAG TCTGCCACGG TTCAAGTGCT TAAGGTTATC
CGCCATTCCC GAAAATTACA GCGGGGCGCG CTCAAGGGAA ACCGTTTTCT ATTGACCTTG
CGCCACTTCC AGGGTGATCG GGAGGTTGTT TGCGACCGCC TGACACAGAT TAAAGTTGCG
GGGACTCCCA ATTACTTTGG ACCGCAGCGT TTTGGCCGGG GGGGCCAGAA TCTGGATCAG
GTGCACCGTT GGTTTAGTGG AGGCAAGCCA CCCAGGGGGC GTTATTTACG GGGAATGCTG
CTTTCGGCAG CCCGCGCTTT TTTATTTAAT AGGGTCTTGT CGGAGCGCGT CCAGGCAGCT
AATTGGTGGC AACCACTTCC AGGCGAGGCG CTTATTCTGG ATGGCAGCCA TGGCTTTTTT
GTAGCGGAGA CCATAGATGA AGCCTTGCAA GCCCGGGTGA GGCGCTTCGA CTGCCATCCC
AGTGGTCCTT TATGGGGGCG AGGGGAATCT CCCGCTAAGA GGATGAGCCG GGCTCTTGAG
GAAGAGGTAT TGGCGGATTA CGCATTATGG CGGGAAGGTC TGGAGCAGGC AGGCTTAAAG
CAAGAGCGCC GTAGTTTGCG TTTAATGGTA GCTGATTTGG AATGGTCTTT TCCTCCTGCT
ATGGATAGCT TGCAGCTTCA TTTTCGTTTA CCCGCTGGGG CTTATGCCAC CACTGTATTG
CGGGAAGTGG TCAGGACCCA AGAGGCGGTG GGACAGCCTT TCCTTTTAGA TGAATAA
 
Protein sequence
MEADEAGQQV LAYGGDPPLA TALLRCRPED FQVVEELPFA LSGEGEHVWL LLCKRNTNTV 
WLARQLARIA GVRLVDVGYA GLKDRHGLTT QWFSVNLSGK KEPAWATALE SATVQVLKVI
RHSRKLQRGA LKGNRFLLTL RHFQGDREVV CDRLTQIKVA GTPNYFGPQR FGRGGQNLDQ
VHRWFSGGKP PRGRYLRGML LSAARAFLFN RVLSERVQAA NWWQPLPGEA LILDGSHGFF
VAETIDEALQ ARVRRFDCHP SGPLWGRGES PAKRMSRALE EEVLADYALW REGLEQAGLK
QERRSLRLMV ADLEWSFPPA MDSLQLHFRL PAGAYATTVL REVVRTQEAV GQPFLLDE