Gene PCC8801_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2004 
Symbol 
ID7104774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2078870 
End bp2080780 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content48% 
IMG OID643475065 
Producthypothetical protein 
Protein accessionYP_002372197 
Protein GI218246826 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAAT CTACTGAAAC GCCAACTGCT ATCAAAGACT TGCCGATCAA GTTGAAAATT 
CCCGTTCAAC TGAGGAAATT AAACGAATTG CCTTGGCAAG CTTGGGCAAT TGCTCTGATT
GTTGCCTCTG GAACAGTGGG ATTTACTGCC ACGTCAATGT TACTCAGCTT GCCGAAATCG
GCTCAATGTA CCAGAGTATT CTGGCCGATC GCCTCAGCGT CTACCCGTAT CTACTGTGCC
CAACTTGAAG CAGAACAGGG AACCGTAGAC AGTTTGCTCA AAGCCATTAA CCTCGTAGAA
GCCTTACCCG CGAACCATCC CCTACGGGAC GATATTAACC AAAATGTCGA AGAATGGGCA
GTTGCCATCC TCGATATTGC CGAAAAAGAA TTTCAAGCCG GAAAACTCGA CGAAGCCATC
AGTATTGCCC GTAAAATCCC CAGTCATGTT CAAGTTTACA ACGTGGTAGA CGAACGGATC
GAAGCCTGGC GATCGCTGTG GCAAGAAGGA GAAGCGATTT TTGCAGAAGT CGAAGAACAT
CTGCGCAACG CTGAATGGAA CCAAGCGTTC CGCACTGCCG TTAAACTGTT GAACCTAGAC
AATGAATACT GGTCTACCAT CAAATACGAT GAAACCATTA AAGAAATTCA ATTAGCCCAA
GAAGAAAGCA GTAAACTCGA TAAAGCCTAT AGCATCCTAC GACGGGGTGG CGTGGATAAT
TGGCTACAGG CTATCACAGA AGCTCAACAG GTAGCAGCCG ATAGCTATGC TTATCGAGAA
GCCCAAAAAC TCATCGAGGA GGCTAAAGAA AAGTTAGTCA GCCACATCGA AGGGTTAATG
GATGACAACC GTTGGGAAAG CGTCCTAGAA GTCGTTGATC GTCTGCCAGA AAGTTTAGCC
CTAACCGAAG AGATCACCGA CTGGAAAGCC TTGGCTAGTG CAGGACTCGA AGCCCAGAAC
GGTACAGTAG AAAGCTTACA AACGGCGATC GCCTCAGCCC AAGAAATTGA CGCAGATCGC
CCACTTTATC AAGACGCGCA AGATTTAATT AGCCGTTGGC AACTGGAAAT CGAAGGTGTC
ACCCATCTGC AAAAAGCCCG CGATTTAGCC CAAGGAGGCA CGATTAATGA CCTCAATGGG
GCGATCGCCT CCGCAGAATT AGTCACCTCT GTAAACCCCC GTTATGGCGA AGCTCAAGGA
GAAATTAGAG ACTGGAGAAG ACGGATTCAA ATCTCGGAAG ATCAACCTCT GTTAGATCAA
GCCAGGGATC TGTCCCGCGA TGGCAGTATT GCCTCCTTAC AAGAAGCGAT CGCCAAAGCT
AGTCTCATTG GTGCTAATCG AGCACTCTCT TCGGAAGCAC AACAAGAGAT TGAAAAATGG
CGCAACAGTA TTCAACGTCA AGAAGATCAA CCCCTTTTGG ATCAAGCGAT CGCCCTAGGG
GATGTCAAAG ACTATAACGC TGCCATTAGT ACGGCTGAAC GAATTGGCCG GGGACGAGTC
CTCTATCAAG AAGCCCAAAA TAACATCCGA GGATGGCGAC GGGAAATACG CGCCCAAAAG
AATCTCCAAG AAGCCTATCT CATCGCCCAA GGGAGAACCC CTCAAGCCTT AGCCTCAGCG
ATTAGTTTAG TCCAAAAAAT CCCTCGTTCG ACGGACGTGA GCCTTGAAAG CCGACAAGTG
CTTAACCGTT GGAGTGCCGA ACTGTTATCG ATGGCTGAAG ATCAAGCGCG ACGATCCCTC
TTAGAAGAAG CGATTAAATT AGCTCGCATG GTCCCTTCTG ACAGTGAGGT TTACAACTCG
GCTCAACTTC AGATTCAAGC TTGGAAAGGC ATCCTTCAAC CGCCTACACC TGCGGTCATT
AATGAGACGA ATCAACCTCT ATTACCCATC AATTCACCCC AAAATCAATG A
 
Protein sequence
MNQSTETPTA IKDLPIKLKI PVQLRKLNEL PWQAWAIALI VASGTVGFTA TSMLLSLPKS 
AQCTRVFWPI ASASTRIYCA QLEAEQGTVD SLLKAINLVE ALPANHPLRD DINQNVEEWA
VAILDIAEKE FQAGKLDEAI SIARKIPSHV QVYNVVDERI EAWRSLWQEG EAIFAEVEEH
LRNAEWNQAF RTAVKLLNLD NEYWSTIKYD ETIKEIQLAQ EESSKLDKAY SILRRGGVDN
WLQAITEAQQ VAADSYAYRE AQKLIEEAKE KLVSHIEGLM DDNRWESVLE VVDRLPESLA
LTEEITDWKA LASAGLEAQN GTVESLQTAI ASAQEIDADR PLYQDAQDLI SRWQLEIEGV
THLQKARDLA QGGTINDLNG AIASAELVTS VNPRYGEAQG EIRDWRRRIQ ISEDQPLLDQ
ARDLSRDGSI ASLQEAIAKA SLIGANRALS SEAQQEIEKW RNSIQRQEDQ PLLDQAIALG
DVKDYNAAIS TAERIGRGRV LYQEAQNNIR GWRREIRAQK NLQEAYLIAQ GRTPQALASA
ISLVQKIPRS TDVSLESRQV LNRWSAELLS MAEDQARRSL LEEAIKLARM VPSDSEVYNS
AQLQIQAWKG ILQPPTPAVI NETNQPLLPI NSPQNQ