Gene Syncc9605_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_2030 
Symbol 
ID3737706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp1846747 
End bp1847748 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content66% 
IMG OID637776616 
Productdihydrouridine synthase TIM-barrel protein nifR3 
Protein accessionYP_382325 
Protein GI78213546 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCTC TACCCCCTCT GCAGCTACCC GGCAATGGCA TCGCACGTCA GCTGCGCTGC 
CGCGTGTTGC AGTCGCCGCT GGCAGGGGTG AGCGATCGAG TGTTTCGCAG CCTGGTTCGA
CGCTGGGCGC CCGATGCCCT GTTGTTCACC GAAATGGTGA ATGCCACCAG CCTCGAGATG
GGGCATGGAC TGTGCAAGGT GGAATCGCTC GCCGAGGAAT CCGGCCCCAT CGGCGTGCAA
CTGTTCGACC ATCGCCCCCA GGCCATGGCC GATGCGGCAC GACGGGCCGA AGCCAGTGGC
GCCTTTCTGA TCGACATCAA CATGGGCTGC CCGGTGCGGA AGATTGCCCG CAAAGGGGGC
GGTTCCGGGT TGATCCGTGA TCCCGGGCTG GCCATTCAGA TCGTGGAAGC GGTGGCGGAC
GCGGTGGCCG TGCCTGTCAC GGTGAAGACA CGCCTGGGTT GGTGTGGCAG TGATGCCGAT
CCCGTGCACT GGTGCCAGCA ATTGGAACAA GCCGGGGCAC AACTTCTCAC TCTGCATGGA
CGCACCCGCG AGCAGGGCTT CAAGGGTGCC GCCGACTGGA GCTCCATCAG GCAGGTGCGG
GAGGCCCTCA CGATCCCGCT AATCGCGAAC GGCGACATCA ACAGCCCCGA CGATGCCCTG
CGCTGCCTGA AACAGACCGG CGCAGCGGGC GTGATGGTGG GCCGAGGCAC GATGGGGTCC
CCATGGTTGG TGGGTCAGAT CGACGCCGCC CTAGCCGGTC GCTCGATCCC CGCCACGCCG
GATCCCTCAG CACGACTTGC GCTGGCCCGC GATCAATTGG ATGGCCTCGT GCAGGATCGC
GGTGACCACG GGCTGCTGAT TGCCCGCAAA CACATGGGAT GGACCTGCAC GGGCTTCCCC
GGCGCCTCGC GACTGCGTCA TGACCTGATG CGGGCACCCA CACCCGCCCA GGCCAGGGAT
CTGCTTACTC AGCAGATCGA TGCCCTTGCC GCGTCCGCTT GA
 
Protein sequence
MIALPPLQLP GNGIARQLRC RVLQSPLAGV SDRVFRSLVR RWAPDALLFT EMVNATSLEM 
GHGLCKVESL AEESGPIGVQ LFDHRPQAMA DAARRAEASG AFLIDINMGC PVRKIARKGG
GSGLIRDPGL AIQIVEAVAD AVAVPVTVKT RLGWCGSDAD PVHWCQQLEQ AGAQLLTLHG
RTREQGFKGA ADWSSIRQVR EALTIPLIAN GDINSPDDAL RCLKQTGAAG VMVGRGTMGS
PWLVGQIDAA LAGRSIPATP DPSARLALAR DQLDGLVQDR GDHGLLIARK HMGWTCTGFP
GASRLRHDLM RAPTPAQARD LLTQQIDALA ASA