Gene PCC8801_2866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2866 
Symbol 
ID7104387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2956140 
End bp2957549 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content40% 
IMG OID643475902 
Productanthranilate synthase component I-like protein 
Protein accessionYP_002373021 
Protein GI218247650 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR01824] aminodeoxychorismate synthase, component I, clade 2 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGATA TCTTAACAGG TTGGTATTGG CGATCACGTC CTTTGAACCA TCAAACAGGT 
TCACAAATAT TTGAGCGTTT ATTTAACGAT AATCAAACCA TTGCAACCCT CTTAGAAAGT
CCCTTTCCTA CCCCCACTGA CTACCCTTAT CTTTCTCGCT ATTCTATTTG TGCAGGCCAA
CCTCGTATTA TTAACAAAAA ACCTCAAGTA TGGACTCCTA AGCTCGGAGA AGTTTTCCCC
TTTTTAAAGA GTTTATTAGA AAGAAATTTT TGTCTTTCTG AAAGCCCTAA TCATCTCCCT
TTTATTGGAG GTTGGTTAGG ATGGCTAGGG TATGATTTGG CTTGGGAAAT TGAACAATTA
CCCCATACAA ATCGAGATAT TTTACCCTTT CCTGTTGCCT ATTGGTATGA ACCCGAATCC
TTTGCTATTC TCGATCATGT TGAACAAAGG CTATGGTTAG CCAGTACCAC CCTTGATCAA
CTCGATGAAT TAGAACAAAA ATTAGAGCAA GATCTGCCTT TAATTCCTGA CCTTTTTACC
CCTGCTTCTG GGCTTTCTTT CTATACGACT CAACAAGAAT ACGAAAATGC TGTCCGTCAA
GCCAAAAAAT ATATCGAAGC AGGAGATATT TTTCAAGCCA ATCTTTCCTT GAGATTTCAT
TCAACTACCG TTGCTGATAG TTGGACTATC TATCGAAATT TACAAAGAAT TAATCCTTCT
CCTTTTGCGA GCTATTGGCG AACACCTTGG GGAGATGTTA TTAGTTGTTC TCCTGAAAGA
TTAATTCAAT TACAAGGAAA TCAAGCCCAA ACTAGGCCAA TAGCAGGAAC ACGACCCCGT
GGTAAAACAC CCGAACTCGA ACAACACTTA TTAGCCGAAT TAACCCGTGA TATTAAAGAA
CAAGCCGAAC ATATTATGTT AGTTGATTTA GAACGAAACG ATTTAGGACG AGTGTGTCAG
TGGGGATCGG TTTATGTGGA TGAATTATTA ACCATAGAAC GCTATAGTCA TGTGATTCAT
TTAGTTAGTA ATGTTAGGGG AACTTTAGCC CGCGATCGCA ATGTCATTGA TCTAATTAAA
GCCCTTTTTC CAGGGGGAAC CATCACCGGA TGTCCTAAAG TCCGTTGTCT AGAAATTATT
GAAGAATTAG AACCTTTGCG TCGCAATCTT TTTTATGGTT CCTGTGGCTA TTTAGATCAA
CGGGGAAATC TGGATTTAAA CATACTCATT CGGACACTTT TATCAACGTC TTTATCGAAT
GGGTTAAAGG GCATTTGGGG ACAAGTAGGT GCGGGAATTG TCGCCGATAG TGACCCCGAA
AAAGAATGGT ATGAGTCCCT ACAAAAAGCT CAAGCCCAGT TAGCGGCTTT GAATCAAGTC
AGAAGTCAGA AGTCAGAAGT CAGAAGTTAA
 
Protein sequence
MTDILTGWYW RSRPLNHQTG SQIFERLFND NQTIATLLES PFPTPTDYPY LSRYSICAGQ 
PRIINKKPQV WTPKLGEVFP FLKSLLERNF CLSESPNHLP FIGGWLGWLG YDLAWEIEQL
PHTNRDILPF PVAYWYEPES FAILDHVEQR LWLASTTLDQ LDELEQKLEQ DLPLIPDLFT
PASGLSFYTT QQEYENAVRQ AKKYIEAGDI FQANLSLRFH STTVADSWTI YRNLQRINPS
PFASYWRTPW GDVISCSPER LIQLQGNQAQ TRPIAGTRPR GKTPELEQHL LAELTRDIKE
QAEHIMLVDL ERNDLGRVCQ WGSVYVDELL TIERYSHVIH LVSNVRGTLA RDRNVIDLIK
ALFPGGTITG CPKVRCLEII EELEPLRRNL FYGSCGYLDQ RGNLDLNILI RTLLSTSLSN
GLKGIWGQVG AGIVADSDPE KEWYESLQKA QAQLAALNQV RSQKSEVRS