Gene PCC8801_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3643 
Symbol 
ID7103325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3800311 
End bp3801531 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content43% 
IMG OID643476653 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_002373761 
Protein GI218248390 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAGG GGTTCGAGTT TGATGTGTTT CTAGCGCATA ATAGTGTGGA TAAACCCCAT 
GTTAGGGAGA TTAGTAACAA ACTAAGGGAA CGAGGGTTAA AACCTTGGCT AGATGAGGAA
CAAATCCCTC CTGGGATGTC ATTTCAGGAT GAAATTCAAA AAGCGATTCC CCTGATTAAA
TCGGCAGCTA TTATTATTGG TACTCAGGGA TTAGGAAAAT GGCAGATCAT GGAACTGCGA
TCGCTTATCA CTAAATTTGT GAATCTAAAA ATTCCTGTTA TTCCTGTTTT GTTGCCAGGG
GTTAATAATA TTCCAGGTGA TTTACTATTC CTACAAGAAC TTAATTGGGT TAAGTTTGAA
CAGATTGATG ATGCTACGGC TTTTTATCGG CTAGAGTGGG GCATTACTCA GGTTAAGCCG
GAGTTACATC CCAAAACTGT ACAATTGACT GCCGAGGAAT GGTTTAACCT TGGCTATAAC
AAGGGTGAAT CAGGAGACAA CCAAGGTGCG ATCGCTGACT TTAATCAAGC CATTAAAATC
AAATCCGACT TGGCAGAAGC GTACTACAAT CGCGGGTTAG CCAAGTCTAA CTTAGGAGAC
TATCAAGGTG CGATCTCTGA CTACAATCAA GCCATTGAAA TCAAACCCGA CTATGCTGCT
GCCTACAACA ATCGTGGATT AACTAAGTAT AACTTAGGAG ACAACCAAGG TGCGATCACA
GACTACACTC AAGCGATTGA AATCAAACCC GACGATGCTG ATGCCTACTA TAATCGCGGG
TTAGCCAAGT ATAACTTAGG AGACAAGCAA GGGGCGATCG CTGACTACAA TCAAGCGATT
AAAATCAAAC CCGACTATGC TACTGCCTAC AACAATCGCG GGAATGCTAA GTATAACTTA
GGAGACAAGC AAGGGGCGAT CGCTGACTAC AATCAAGCGA TTAAAATCAA ACCCGACTAT
ACCCTTGCCT ACATCTGTTG CGGGTTAGCC AAGTCTAACT TAGGAGACAA CCAAGGTGCG
ATCACTGACT ACAATCAAGC GATTAAAATC AAACCCGACT ATGCTGATGC CTACATCTGT
CGCGGGAATG CCAAGAAAAA CTTAGGAGAC AACCAAGGTG CGATCGCTGA CTACAATCAA
GCAGCACAAC TTTACTCGCA GCAAAATAAT ATGGAATGGT ATCTTAAAGC CCTTGAAAAG
ATCAAAAAAC TTGAACAATG A
 
Protein sequence
MSEGFEFDVF LAHNSVDKPH VREISNKLRE RGLKPWLDEE QIPPGMSFQD EIQKAIPLIK 
SAAIIIGTQG LGKWQIMELR SLITKFVNLK IPVIPVLLPG VNNIPGDLLF LQELNWVKFE
QIDDATAFYR LEWGITQVKP ELHPKTVQLT AEEWFNLGYN KGESGDNQGA IADFNQAIKI
KSDLAEAYYN RGLAKSNLGD YQGAISDYNQ AIEIKPDYAA AYNNRGLTKY NLGDNQGAIT
DYTQAIEIKP DDADAYYNRG LAKYNLGDKQ GAIADYNQAI KIKPDYATAY NNRGNAKYNL
GDKQGAIADY NQAIKIKPDY TLAYICCGLA KSNLGDNQGA ITDYNQAIKI KPDYADAYIC
RGNAKKNLGD NQGAIADYNQ AAQLYSQQNN MEWYLKALEK IKKLEQ