Gene A9601_10981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_10981 
Symbol 
ID4717809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp938171 
End bp939178 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content33% 
IMG OID640078813 
Productnitrogen regulation protein NifR3 family-like protein 
Protein accessionYP_001009489 
Protein GI123968631 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family
[TIGR00742] tRNA dihydrouridine synthase A 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTCAA ATATAAGGCT AAAAGGAAGG GGAGTTAACA GAAAAATTAC GAGTAAGGTA 
ATGCTATCGC CATTAGCAGG AGTTACGGAT AACATTTTTA GACGACTTGT ACGTAAATGG
GCTCCAAACT CTTTACTTTT TACAGAAATG ATAAATGCCA CAAGTCTTAA AAAAGGATAT
GGCACACAAA AAATCAATCA AATAGATTTA GAAGAAGGTC CAATTGGAGT ACAAATATTT
GATAATAGGC CATATGCTGT TTCTGAAGCT GCAAAACAAG CTGAGGACTC TGGAGCTTTC
TTAATCGATA TAAATATGGG ATGTCCAGTA AAAAAAATTG CAAAGAAAGG TGGAGGCAGT
GCCTTAATTA AAGACCGAAA ACTTGCTATA GAATTAGTCA AAAATGTTGT AAAAGCTGTT
AGGGTTCCTG TAACAGTAAA AACACGACTC GGATGGGATA GTAAAGAAGA AAATATAGAG
GATTTCTTAT TTAAACTTCA AGATGCGGGA GCAACCATGA TCACACTTCA TGGAAGAACT
AGAAAACAGG GTTTTTCAGG CAAGTCAGAT TGGGAAATGA TCGGGAGACT TAAAAAGTTG
TTGGAAATTC CAGTAATTGC TAATGGAGAT ATCAAAAATC CAGATGACGC TCTTAATTGT
TTGAAAAAAA CAAAAGCTGA TGGTGTAATG ATTGGACGAG GAATTTTAGG ATCCCCATGG
AAAATAGGAG AAATAGATTA TGCTCTTAGA GAAAATAAAA ATTTTAAAGA ACCAAACACA
GAAGAAAAAC TATATTTAAT TATTGAGCAT CTTGATGAAT TAATAAAAGA AAAAGGAGAT
CACGGTTTGC TAATCGCAAG GAAACATATC TCATGGACAT GCAAAGACTT TAAAGGGGCA
TCAAATTTGA GAAATAACTT GGTTAGAGCT GTTGATAAAA ATGAAGTTAA AAATTTAATA
AATAAAATGA TTCAAACTTT GAATAATGAA AAAAATAGAT TAGCTTAA
 
Protein sequence
MSSNIRLKGR GVNRKITSKV MLSPLAGVTD NIFRRLVRKW APNSLLFTEM INATSLKKGY 
GTQKINQIDL EEGPIGVQIF DNRPYAVSEA AKQAEDSGAF LIDINMGCPV KKIAKKGGGS
ALIKDRKLAI ELVKNVVKAV RVPVTVKTRL GWDSKEENIE DFLFKLQDAG ATMITLHGRT
RKQGFSGKSD WEMIGRLKKL LEIPVIANGD IKNPDDALNC LKKTKADGVM IGRGILGSPW
KIGEIDYALR ENKNFKEPNT EEKLYLIIEH LDELIKEKGD HGLLIARKHI SWTCKDFKGA
SNLRNNLVRA VDKNEVKNLI NKMIQTLNNE KNRLA