Gene Cyan8802_3402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3402 
Symbol 
ID8392738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3471865 
End bp3473070 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content38% 
IMG OID644981339 
Productpentapeptide repeat protein 
Protein accessionYP_003139065 
Protein GI257061177 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TTGCTTCAAT TCTTCTCCTA ACTCCTCTGT TACTAACCTC GGTTGTTAGA 
GCAGAAAATC CGAGTTCAGT AGAACGATTA TTAACCACAA GAGAATGTGT TGGTTGCGAC
TTAGAAAATG CTAATCTCAA AGGCATGAAT TTAGAAGGAG CCAATCTCGA AAAAGCAAAT
TTAAAAAATG CCAATTTGGA GAGAGCTAAC CTGAAGAATG CTAACTTAAA ACAAGCCATT
TTGCAAGATG CGAAATTAAC CGAAGCTCAA CTCGAAGGCG TTATGCTTGA TGGGGTTAAT
TTTATTAATG CCAAGCTAAA AGGGGTTAAT TTAAGCGGAG TGAACCTTAA AGGGGTTAAT
TTCGTCAATG CGGAGATGGA TGGCATTATC TTAACGAATG CTAACCTAGA AGGAGCCCAG
ATGAGGGGTG TCACCCTCGA AGGAGCCAAC CTAGACGGAG CTAACTTGCA GGGAGTTGAT
TTAACGGTTC ATGACGAAGA ACGAGCCAAT TTAACGGGTG CAAGTCTAAA AAATGCTGAC
TTGTCAGGGG GTTTTCTGCG GGGTATCAGA CTAAAAAACG CTAACCTTGA AGGCGCAAAT
CTTTCTAAAA CCGACTTTAG CCGCGATATT CCTAATAATA CCACCGCTAA AGGAGCTCTT
AATGTCGCTA CTACCCCCAT TCCTTTAATT TTTCCTGGGG CAATTTTGGG GGTTATTGGA
GACGTAGCTA TTAATGAAGC TTCGGCTCTC AATGCGGATG TGAGTTATGC CAATTTAGAA
GGAGCTAACT TACAAGATTC TAACCTAGAA GACATTAATT TTGAGAGTTC TAATCTAAAG
AATGCTAACT TACAAAATGC GAATTTAAAC AATGCTTATT TAGTCAATAC TAACTTGACT
AATGCTAATT TAAGTTCAGC TAATTTAACT AACATTAATA TGCAAGGAGT GAACTTAAGT
TATGCTAACT TAATGGGAGC TAATTTAGAC GGTTCTTATT TAGTTAATGC TAATTTGAGC
CATGGTAACC TTGAATCAGC GCACTTAACA AGTATTAATA TGAGTGGTGC TCAGTTAAGT
AATGCTAACT TAAGCGAAGC TAAATTAACC GATTCCAACT TGAGTAATTC TAACTTGTGT
AGTGCCACGA TGCCTGATGG TTCGATTTCT CAAATTGGAT GTACTGCTGT TAATCTTGAC
AAGTAA
 
Protein sequence
MKKIASILLL TPLLLTSVVR AENPSSVERL LTTRECVGCD LENANLKGMN LEGANLEKAN 
LKNANLERAN LKNANLKQAI LQDAKLTEAQ LEGVMLDGVN FINAKLKGVN LSGVNLKGVN
FVNAEMDGII LTNANLEGAQ MRGVTLEGAN LDGANLQGVD LTVHDEERAN LTGASLKNAD
LSGGFLRGIR LKNANLEGAN LSKTDFSRDI PNNTTAKGAL NVATTPIPLI FPGAILGVIG
DVAINEASAL NADVSYANLE GANLQDSNLE DINFESSNLK NANLQNANLN NAYLVNTNLT
NANLSSANLT NINMQGVNLS YANLMGANLD GSYLVNANLS HGNLESAHLT SINMSGAQLS
NANLSEAKLT DSNLSNSNLC SATMPDGSIS QIGCTAVNLD K