Gene PCC8801_2368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2368 
Symbol 
ID7104638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2437238 
End bp2438248 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content38% 
IMG OID643475409 
ProductTIM-barrel protein, nifR3 family 
Protein accessionYP_002372537 
Protein GI218247166 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCAGT TAAAAGAAAA ATTATCAACT CCCTTAAAAA TTGCCTCGGT GGAGCTAAAA 
AGTCGTGTTT TTCAGTCCCC TTTATCGGGG GTAACTGACT TAGTATTTCG TCGTTTAGTT
AGACGATATG CCCCTCAATC GATGATGTAT ACTGAGATGG TAAGTGCCAC AGAAATTCAT
CATCTTAAAG CAATTCCTAG ACTGATGGAA ATTGCACCTG ATGAAGATCC AATTAGTATT
CAACTATTTG ATTGTCGTCC TGATTTTATG GCAGAAGCAG CAGAAAAAGC AGTCGCTGAA
GGAGCCAAAA CTATCGATAT TAATATGGGT TGTCCTGTTA ATAAAATCAC CAAAAAAGGC
GGCGGTTCGT CTTTGCTTCG TCAACCAGAA ATTGCCCAAG CTATTGTTAA AGAAGTGGTT
AAGACTGTCG ATATTCCTGT CACTGTAAAA ACCCGTATTG GTTGGGATGA TCAAGAGATT
ACAATTCTTG ATTTTGCAAA AAAAATGGAA GATGCAGGAG CCCAAATGCT AACTATTCAT
GGCAGAACGC GCGCTCAGGG TTATAATGGA AAAGCTCGAT GGGAATGGAT TGCCAAAGTT
AAAGAAATTG TCAGTATTCC CGTGATTGCT AATGGAGATA TCTTTTCAGT AGACGCGGCG
ATTAAGTGTT TAGAAGAAAC TAATGCAGAT GGCGTAATGT GTTCGCGGGG GACGTTAGGG
TATCCCTTTT TAGTTGGAGA AATTGACTAT TTTTTACAAA CAGGAACTCG ACGGGCTATT
GTCACTCCTA GTCAACGCTT AGAATGTGCT AAGGAACATT TTAATAATTT GTGGGAATAT
AAAGGGATTA AAGGAATTTA TCAGTCAAGA AAACATTTAA GTTGGTATTG TAAAGGATTT
TCTGGTGCAT CAGAATTACG CGATCGCGTC TCCCGTATTG AAACCCTTGA AGAAGGAAAT
CAATTATTAG ATCATGCCAT AGAATTATGT AGAAAAACTG AAATTAATTA A
 
Protein sequence
MFQLKEKLST PLKIASVELK SRVFQSPLSG VTDLVFRRLV RRYAPQSMMY TEMVSATEIH 
HLKAIPRLME IAPDEDPISI QLFDCRPDFM AEAAEKAVAE GAKTIDINMG CPVNKITKKG
GGSSLLRQPE IAQAIVKEVV KTVDIPVTVK TRIGWDDQEI TILDFAKKME DAGAQMLTIH
GRTRAQGYNG KARWEWIAKV KEIVSIPVIA NGDIFSVDAA IKCLEETNAD GVMCSRGTLG
YPFLVGEIDY FLQTGTRRAI VTPSQRLECA KEHFNNLWEY KGIKGIYQSR KHLSWYCKGF
SGASELRDRV SRIETLEEGN QLLDHAIELC RKTEIN