Gene PCC8801_2701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2701 
Symbol 
ID7105038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2788388 
End bp2789593 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content38% 
IMG OID643475739 
Productpentapeptide repeat protein 
Protein accessionYP_002372858 
Protein GI218247487 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTGCTTCAAT TCTTCTCCTA ACTCCTCTGT TACTAACCTC GGTTGTTAGA 
GCAGAAAATC CGAGTTCAGT AGAACGATTA TTAACCACAA GAGAATGTGT TGGTTGCGAC
TTAGAAAATG CTAATCTCAA AGGCATGAAT TTAGAAGGAG CCAATCTCGA AAAAGCAAAT
TTAAAAAATG CCAATTTGGA GAGAGCTAAC CTGAAGAATG CTAACTTAAA ACAAGCCATT
TTGCAAGATG CGAAATTAAC CGAAGCTCAA CTCGAAGGCG TTATGCTTGA TGGGGTTAAT
TTTATTAATG CCAAGCTAAA AGGGGTTAAT TTAAGCGGAG TGAACCTTAA AGGGGTTAAT
TTCGTCAATG CGGAGATGGA TGGCATTATC TTAACGAATG CTAACCTAGA AGGAGCCCAG
ATGAGGGGTG TCACCCTCGA AGGAGCCAAC CTAGACGGAG CTAACTTACA GGGAGTTGAT
TTAACGGTTC ATGACGAAGA ACGAGCCAAT TTAACGGGTG CAAGTCTAAA AAATGCTGAC
TTGTCAGGGG GTTTTCTGCG GGGTATCAGA CTAAAAAACG CTAACCTTGA AGGCGCAAAT
CTTTCTAAAA CCGACTTTAG CCGCGATATT CCTAATAATA CCACCGCTAA AGGAGCTCTT
AATGTCGCTA CTACCCCCAT TCCTTTAATT TTTCCTGGGG CAATTTTGGG GGTTATTGGA
GACGTAGCTA TTAATGAAGC TTCGGCTCTC AATGCGGATG TGAGTTATGC CAATTTAGAA
GGAGCTAACT TACAAGATTC TAACCTAGAA GACATTAATT TTGAGAGTTC TAATCTAAAG
AATGCTAACT TACAAAATGC GAATTTAAAC AATGCTTATT TAGTCAATAC TAACTTGACT
AATGCTAATT TAAGTTCAGC TAATTTAACT AACATTAATA TGCAAGGAGT GAACTTAAGT
TATGCTAACT TAATGGGAGC TAATTTAGAC GGTTCTTATT TAGTTAATGC TAATTTGAGC
CATGGTAACC TTGAATCAGC GCACTTAACA AGTATTAATA TGAGTGGTGC TCAGTTAAGT
AATGCTAACT TAAGCGAAGC TAAATTAACC GATTCCAACT TGAGTAATTC TAACTTGTGT
AGTGCCACGA TGCCTGATGG TTCGATTTCT CAAATTGGAT GTACTGCTGT TAATCTTGAC
AAGTAA
 
Protein sequence
MKKIASILLL TPLLLTSVVR AENPSSVERL LTTRECVGCD LENANLKGMN LEGANLEKAN 
LKNANLERAN LKNANLKQAI LQDAKLTEAQ LEGVMLDGVN FINAKLKGVN LSGVNLKGVN
FVNAEMDGII LTNANLEGAQ MRGVTLEGAN LDGANLQGVD LTVHDEERAN LTGASLKNAD
LSGGFLRGIR LKNANLEGAN LSKTDFSRDI PNNTTAKGAL NVATTPIPLI FPGAILGVIG
DVAINEASAL NADVSYANLE GANLQDSNLE DINFESSNLK NANLQNANLN NAYLVNTNLT
NANLSSANLT NINMQGVNLS YANLMGANLD GSYLVNANLS HGNLESAHLT SINMSGAQLS
NANLSEAKLT DSNLSNSNLC SATMPDGSIS QIGCTAVNLD K