Gene Jann_2236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2236 
Symbol 
ID3934690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2240715 
End bp2241758 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content65% 
IMG OID637904593 
Productdihydrouridine synthase TIM-barrel protein nifR3 
Protein accessionYP_510178 
Protein GI89054727 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.64486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCTTCCC CGATGTCCCG CAGTATCCAC CAAGAGACCG CCCAAGGATT AAGCGTTTTG 
CCCGTAACCC TCGCCGACAT CGCACTAGAC CCCCCTGTCC TGCTCGCACC GATGGCGGGG
ATCACGGATC TGCCGTTTCG CAGGCTGGTG GCCAGCTTTG GCGCCGGGCT TGTGGTGAGC
GAGATGGTGG CGTCGCAGGA AATGGTGGAA GCGAAGGCCT CCGTCCGGGC GCGGGCCGAG
CTTGGGTTCG GTGAGAACGC CACCGCCGTG CAACTGGCAG GCCGTGAGGC CCATTGGATG
GCGGAGGCCG CCCGGATCGC CGAGGGGCAG GGCGCGCGGA TCATCGATAT CAACATGGGT
TGCCCGGCCA AGAAGGTCGT TGGCGGGCTG TCAGGCTCTG CGCTGATGCG CGACCTCGAC
CATGCCTTAC GCCTGATTGA CGCCGTGGTG GGCGCCGTGA ATGTGCCCGT CACGCTAAAG
ACGCGGCTGG GGTGGGATGA TAAGACGCTG AATGCGCCTG CTCTCGCCCA GAGGGCAGAA
ACCGCTGGTA TCCGGATGAT TACCATCCAT GGCCGCACCC GGTGCCAGTT CTACAAGGGC
ACCGCAAAGT GGGGCGCGAT CCGTGCCGTG AAAGACGCCG TGTCGATCCC CGTCATCGCC
AATGGCGACA TCACCGATGC GCCTGCCGCC GCGCAAGCCC TGCGCTGTTC CGGGGCCGAT
GGCGTCATGG TGGGACGTGG CATTCAGGGG CAGCCCTGGC TGGTGGCGGA GATCGGCGCG
GCTTTGTTCG GCACACCGAC GCCCGTCGCG CCCCAAGGCG ATGCACTGGT CGCGATGGTC
GCGGCGCATT ACGAAGACAT GCTGGCGTTT TATGGCCGTG ACTTGGGCGG CAAAGTCGCG
CGCAAGCATT TGGGATGGTA CATGGACGGC GCGGGAACCC CGCCCGCATT GCGCAAAGAG
GTATTGACCA CCAAAGATCC CGGTGCCGTG TTGAAGGCCC TGCCGCTGGC CCTCGCACAA
ACGTCCCAAA GGGCCGCCGC ATGA
 
Protein sequence
MSSPMSRSIH QETAQGLSVL PVTLADIALD PPVLLAPMAG ITDLPFRRLV ASFGAGLVVS 
EMVASQEMVE AKASVRARAE LGFGENATAV QLAGREAHWM AEAARIAEGQ GARIIDINMG
CPAKKVVGGL SGSALMRDLD HALRLIDAVV GAVNVPVTLK TRLGWDDKTL NAPALAQRAE
TAGIRMITIH GRTRCQFYKG TAKWGAIRAV KDAVSIPVIA NGDITDAPAA AQALRCSGAD
GVMVGRGIQG QPWLVAEIGA ALFGTPTPVA PQGDALVAMV AAHYEDMLAF YGRDLGGKVA
RKHLGWYMDG AGTPPALRKE VLTTKDPGAV LKALPLALAQ TSQRAAA