Gene P9303_21571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21571 
Symbol 
ID4777449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1916505 
End bp1917515 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content59% 
IMG OID640087665 
Productputative nitrogen regulation protein NifR3 family protein 
Protein accessionYP_001018157 
Protein GI124023850 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0855264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGAAC TCAGCCTGCC CGGCAGGGGA ACAACAAGGG CACTGCGCTG CAGAGTGCTG 
CAATCGCCCC TTGCAGGCGT GAGTGATCAG ATCTTCCGAA GCCTGGTTCG GCGCTGGGCA
CCAGATGCAT TGTTATTCAC GGAAATGGTC AATGCCACCA GCCTGGAGCT GGGCCACGGC
CTACAGAAAA TCGACGAACT CGCCAATGAA GCCGGGCCCA TTGGCGTGCA ATTGTTCGAT
CACCGCCCAG AAGCGATGGC CGATGCTGCA CAGCGAGCAG AGGCTGCTGG TGCCTTCCTG
ATCGACATCA ACATGGGCTG TCCTGTGCGC AAGATTGCAC GCAAGGGCGG TGGCTCTGGT
CTGATCCGTG ACCCACAACT AGCGGCAAAA ATCGTGAGCA CTGTTGCTGC AGCAGTCAAA
ATCCCAGTCA CCGTGAAGAC AAGGCTGGGC TGGTGTGGCA GCGATGCAAG CCCTGTGCGC
TGGTGTCAAT GGCTCGAGCA GGCCGGCGCC CAGATGTTGA CATTGCATGC TCGAACTCGA
GAGCAAGGCT TCAAAGGCTC AGCTGATTGG CTCGCCATCG CTGCCGTCAA AAGCGCACTG
CAAATCCCCG TGATTGCCAA TGGCGATGTC AAGAGCGACA TGGATGCCAA GCGCTGCCTG
GCGATCACTG GAGCCGATGG CGTGATGGTG GGCAGAGGCT CGATGGGAGC GCCATGGCTT
GTGGGTCAAA TCGATGCAGC ACTATCCGGC ATACCCGTGC CTGCAACGCC TGGAGCGGCA
GAGCGACTCA CCATTGCTCG TGAACAACTT GAAGCGCTGG TACAAGCAAA GGGGGAACAT
GGACTCTTGA TTGCCCGTAA ACACATGGGA TGGACTTGTA GCGGCTTCGT CGGCGCGTCG
AAACTGCGCC ATGCCCTGAT GCGTGCACCA ACGCCAACGG ATGCCATATC ACTGTTAGAG
CAAGCCAGCG CAGAACTCAT AACGGCATGG CCTGAAGCGA CCAACGCCTA A
 
Protein sequence
MPELSLPGRG TTRALRCRVL QSPLAGVSDQ IFRSLVRRWA PDALLFTEMV NATSLELGHG 
LQKIDELANE AGPIGVQLFD HRPEAMADAA QRAEAAGAFL IDINMGCPVR KIARKGGGSG
LIRDPQLAAK IVSTVAAAVK IPVTVKTRLG WCGSDASPVR WCQWLEQAGA QMLTLHARTR
EQGFKGSADW LAIAAVKSAL QIPVIANGDV KSDMDAKRCL AITGADGVMV GRGSMGAPWL
VGQIDAALSG IPVPATPGAA ERLTIAREQL EALVQAKGEH GLLIARKHMG WTCSGFVGAS
KLRHALMRAP TPTDAISLLE QASAELITAW PEATNA