Gene PCC7424_2821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_2821 
Symbol 
ID7111090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp3134205 
End bp3135890 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content45% 
IMG OID643481067 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002378096 
Protein GI218439767 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.00495915 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAGATA ATTATAGAAG TCGCATCATT ACTGAAGGGA GTCAACGCAC TCCTAACCGG 
GCTATGCTTC GCGCCGTTGG ATTTGGAGAT GATGACTTTA CTAAACCCAT AGTGGGCATT
GCTAACGGAT ATAGCACGAT TACCCCTTGT AATATGGGGA TTAATGACCT GGCACTAAGG
GCCGAAGACG GACTGAAAAA AGCGGGAGCT ATGCCCCAAA TTTTTGGCAC AATTACCATC
AGTGATGGGA TTTCTATGGG AACAGAAGGG ATGAAATATT CTCTTGTCTC TAGGGATGTA
ATCGCCGACT CCATCGAAAC TGTCTGTAAT GGTCAAAGTA TGGATGGAGT TATAGCGATC
GGGGGTTGTG ATAAAAATAT GCCCGGAGCT ATGATTGCTA TGGCGAGAAT GAATATTCCG
GCGATTTTTG TTTATGGAGG CACAATTAAA CCCGGACACT ACAATGGAGA AGATTTAACT
GTTGTGAGTG CTTTTGAAGC TGTCGGACAA TATAGCGCGG GTAAAATAGA CGATGCCCAG
TTATCAGGAA TTGAACGCAA TGCCTGTCCG GGGGCGGGTT CTTGTGGGGG TATGTTTACC
GCTAATACCA TGTCTTCTGC GTTTGAAGTG ATGGGGATGA GTCTTCCCTA TTCCTCTACA
ATGGCGGCAG AAGATGCAGA AAAAGCCGAT AGTACCGAAA AATCCGCTTT TGTTTTAGTG
GAGGCGATTA AAAAACAAAT TCTCCCCCGT CAGATTTTAA CTCGCAAAGC TTTTGAGAAT
GCTATCTCGG TGATTATGGC GGTTGGCGGT TCGACCAATT CGGTTTTACA TTTATTGGCG
ATCGCTCATA CCATAGGGGT AGAACTGAGT TTAGATGACT TTGAAACCAT TAGAAAGCGA
GTGCCGGTCA TTTGTGACCT GAAACCCTCT GGACGCTACG TTACGGTTAA TTTACATCAA
GCTGGTGGTA TTCCTCAAGT GATGAAAATG CTGTTAGTCC ATGACTTATT ACATGGGGAT
GCTTTAACTA TTACCGGACA AACCATTGCA GAAGTTTTAA AAGATGTCCC CGATGAACCC
CCTCTAGGTC AAGATGTGAT TCGTCCTTGG GATAACCCCG TTTACCAAGA AGGACACCTA
GCTATTTTAA AAGGAAACTT AGCTACAGAG GGAGCAGTAG CTAAAATTAG TGGGGTCAAA
GTTCCTAAAA TTACTGGACC TGCCCGGGTC TTTGAATCAG AAGAAAGTTG TTTAGAAGCC
ATTCTTGCCA AGAGAATTAA GCCGGGAGAT GTGATTATCG TCCGTTATGA AGGGCCAAAA
GGAGGCCCCG GAATGCGGGA AATGTTAGCG CCTACCTCTG CTATTATTGG GGCAGGATTA
GGGGATACAG TGGGATTAAT TACCGATGGG CGTTTTTCTG GAGGAACTTA CGGCATGGTC
GTTGGTCATG TTGCTCCTGA AGCCGCAGTA GGCGGTACTA TTGCCCTCGT AGAAGAAGGC
GATACGATCA CCATTGATGC TCATCAACGC TTATTACAGC TTAATGTGTC TGATGAAGAA
TTAGCCCGTC GTCGTGCTAA GTGGCAACCC CCTCAACCTC GGTATACAAC AGGGGTATTG
GCTAAGTATG CTAAATTAGT CTCTTCGAGT AGTATCGGGG CAGTGACGGA TAAAGATTTA
TTTTAA
 
Protein sequence
MSDNYRSRII TEGSQRTPNR AMLRAVGFGD DDFTKPIVGI ANGYSTITPC NMGINDLALR 
AEDGLKKAGA MPQIFGTITI SDGISMGTEG MKYSLVSRDV IADSIETVCN GQSMDGVIAI
GGCDKNMPGA MIAMARMNIP AIFVYGGTIK PGHYNGEDLT VVSAFEAVGQ YSAGKIDDAQ
LSGIERNACP GAGSCGGMFT ANTMSSAFEV MGMSLPYSST MAAEDAEKAD STEKSAFVLV
EAIKKQILPR QILTRKAFEN AISVIMAVGG STNSVLHLLA IAHTIGVELS LDDFETIRKR
VPVICDLKPS GRYVTVNLHQ AGGIPQVMKM LLVHDLLHGD ALTITGQTIA EVLKDVPDEP
PLGQDVIRPW DNPVYQEGHL AILKGNLATE GAVAKISGVK VPKITGPARV FESEESCLEA
ILAKRIKPGD VIIVRYEGPK GGPGMREMLA PTSAIIGAGL GDTVGLITDG RFSGGTYGMV
VGHVAPEAAV GGTIALVEEG DTITIDAHQR LLQLNVSDEE LARRRAKWQP PQPRYTTGVL
AKYAKLVSSS SIGAVTDKDL F