Gene Syncc9902_0642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_0642 
Symbol 
ID3743464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp655426 
End bp656496 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content54% 
IMG OID637770814 
Productdihydrouridine synthase TIM-barrel protein nifR3 
Protein accessionYP_376654 
Protein GI78184219 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family
[TIGR00742] tRNA dihydrouridine synthase A 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATG AAACTGAGCG GAGTGGTCTT GATCATCCCC AACCTTTGAA CCAAAGCCTG 
CAAACACCCA TTCATCTTCC GGGCAATGGC ACCTGCCGTC AGCTTCATTC GCGGGTCCTG
CAATCGCCTT TAGCGGGCGT AACCGACAAG ATTTTTCGTG GGTTAGTGCG GCGATGGTCT
CGGGATTCCT TGCTGTTCAC AGAAATGGTG AACGCCACAA GCCTGGAGCT CGGCCACGGT
TACGGAAAAA TGGACGACCT CAAGGCCGAG GAAGGCCCCA TCGCGGTTCA ACTCTTCGAT
CACCGCCCTC ACGCGATGGC CCACGCAGCA CGCCGTGCCG AACAGGCTGG GGCCTTTCTT
ATCGATATCA ACATGGGATG TCCAGTCCGA AAAATCGCGA AGAAAGGCGG AGGTTCAGGA
CTCATCCGTG ACCCGAATCT CGCCACTCAG ATTTTGGAGG CGGTCGTTGA CGCTGTATCG
ATTCCGGTCA CCGTAAAAAC GCGCCTCGGT TGGTGTGGGA GCAGTTCGGA CCCGATCACT
TGGTGCCGAC AACTTCAAAA TGCTGGCGCC CAAATGTTGA CCTTGCATGG ACGAACCCGG
GAACAGGGTT TCAAAGGCCA AGCGAATTGG AATGCGATCG CGGAAGTCAA GCAAGCCTTA
ACGATTCCCG TGATTGCCAA TGGAGATATC AACAGCCCCA CCGATGCACT TCGGTGTTTA
GAGATCACAG GAGCCGATGG CGTGATGGTG GGTCGCGGAA CCATGGGGTC ACCCTGGCTG
GTGGGACAAA TTGAAGCCGC ATTCATGAAT CGACCGATTC CTGCAACACC AGACAGTGGT
GCCAGACTTC ACTTAGCCAA AGAACAGTTG CACGATTTGG TGTCATCCAG AGGATCCCAC
GGCCTCCTCA TTGCACGAAA ACATATGAGC TGGACATGCA CTGGGTTCCC AGGAGCACCA
CAACTCCGCC ATTCCTTAAT GCGCGCAACA ACACCAGACG ATGCCTACCA ATTAATCGAT
AACGCCATCA ACTCCTTACG CACAGTCGGG ATCGCACAAG AAGCAAAATA A
 
Protein sequence
MADETERSGL DHPQPLNQSL QTPIHLPGNG TCRQLHSRVL QSPLAGVTDK IFRGLVRRWS 
RDSLLFTEMV NATSLELGHG YGKMDDLKAE EGPIAVQLFD HRPHAMAHAA RRAEQAGAFL
IDINMGCPVR KIAKKGGGSG LIRDPNLATQ ILEAVVDAVS IPVTVKTRLG WCGSSSDPIT
WCRQLQNAGA QMLTLHGRTR EQGFKGQANW NAIAEVKQAL TIPVIANGDI NSPTDALRCL
EITGADGVMV GRGTMGSPWL VGQIEAAFMN RPIPATPDSG ARLHLAKEQL HDLVSSRGSH
GLLIARKHMS WTCTGFPGAP QLRHSLMRAT TPDDAYQLID NAINSLRTVG IAQEAK