Gene PCC8801_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2999 
Symbol 
ID7104491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3106019 
End bp3107158 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content49% 
IMG OID643476028 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_002373142 
Protein GI218247771 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGGC GAAACCTCGT TAAAGCAACC TCTCAGGGTG CGATCGCAGC AACGGTTGCC 
GGGATAGCAA TAGGATGCAG CAAGTCCCAG AACCAAGCAA CCTCTAACAA CACAGAACTG
CCAAAAATTA CTTGGCAGAT GGCCACTAGC TGGCCGCTGT CCCTCGATAC AATCTTTGGT
GGCGCGACGG TTTTTGCCGA AAGAGTGGCT CAGATGAGCG GTGGACGCTT TAAAATCTCC
CCCAAACCCG CCGGAGACTT AGCTCCCCCC CTAGAAGTCT TAAACGTGGT TAAGCAAGGG
GCTGTTCCCT GCGGTCACAC GGCTGCTTAT TATTACATCG GACAAAATCC GGCCGCTGCT
TTCGGTACGG CTGTCCCTTT TGGACTGACG GCTCAACAAC AAAATACTTG GCTCTATGAA
GGGGAAGGCT TAAAACTATT GCAGGAACTC TACGCCAGCC AATTCGGAGT CATCCAATTT
CCGGCGGGTA GCACGGGCAC ACAGATGGGG GGGTGGTTTC GCAAGGAAGT CTCAACCATT
AACGACTTAA AGGGGCTAAA AATGAGGATT CCGGGCTTGG GGGGTCAGGT AATGAGTAAG
TTGGGGGTGC TGGTGCAAAA TCTCCCAGGA GGGGAAATTT TCCAGGCTCT ACAAACGGGT
GCTATTGATG CAGCCGAATG GGTTGGCCCC TACGATGATG AAAAATTGGG ACTTAATAAA
GTCGCTCAAT ACTATTACTA TCCGGGTTGG TGGGAACCGG GTCCGACTCT GGAAGTGCAA
ATTAACCTCA ATGCCTGGAA AAAGTTGCCC GTTGAATATC AACAAATGAT CCAGACCGCC
GCCTTTGAAG CTAATCAGAT CATGCTGGCT CGTTACGAAG CTCGCAACTA TGAGGCATTG
CAAAGATTGC TACAAAGTGG AACCCAACTG CGCCCCTACA GTGATGAAAT CTTAAATGCA
GCTAAGACGA GTGCTTTTGA ATTGTATGAC GAATTTGCCG CAAAAAATGC TGATTTTAAA
GCGATTTTTG AAAACTGGCA GAAGTTCCGC GATGGGGTTT TCACTTGGAG CAATCTCAAT
CAAGGCAGTT TTGAACGGTT TGTTTACAAA ACTCTCGACA CGCCATCCCA AGGCTCATAA
 
Protein sequence
MKRRNLVKAT SQGAIAATVA GIAIGCSKSQ NQATSNNTEL PKITWQMATS WPLSLDTIFG 
GATVFAERVA QMSGGRFKIS PKPAGDLAPP LEVLNVVKQG AVPCGHTAAY YYIGQNPAAA
FGTAVPFGLT AQQQNTWLYE GEGLKLLQEL YASQFGVIQF PAGSTGTQMG GWFRKEVSTI
NDLKGLKMRI PGLGGQVMSK LGVLVQNLPG GEIFQALQTG AIDAAEWVGP YDDEKLGLNK
VAQYYYYPGW WEPGPTLEVQ INLNAWKKLP VEYQQMIQTA AFEANQIMLA RYEARNYEAL
QRLLQSGTQL RPYSDEILNA AKTSAFELYD EFAAKNADFK AIFENWQKFR DGVFTWSNLN
QGSFERFVYK TLDTPSQGS