Gene Cyan8802_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2471 
Symbol 
ID8391796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2492829 
End bp2494049 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content43% 
IMG OID644980438 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003138175 
Protein GI257060287 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0400427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.45906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGG GGTTCGAGTT TGATGTGTTT CTAGCGCATA ATAGTGTGGA TAAACCCCAT 
GTTAGGGAGA TTAGTAACAA ACTAAGGGAA CGAGGGTTAA AACCTTGGCT AGATGAGGAA
CAAATCCCTC CTGGGATGTC ATTTCAGGAT GAAATTCAAA AAGCGATTCC CCTGATTAAA
TCGGCAGCTA TTATTATTGG TACTCAGGGA TTAGGAAAAT GGCAGATCAT GGAACTGCGA
TCGCTTATCA CTAAATTTGT GAATCTAAAA ATTCCTGTTA TTCCTGTTTT GTTGCCAGGG
GTTAATAATA TTCCAGGTGA TTTACTATTC CTACAAGAAC TTAATTGGGT TAAGTTTGAA
CAGATTGATG ATGCTACGGC TTTTTATCGG CTAGAGTGGG GCATTACTCA GGTTAAGCCG
GAGTTACATC CCAAAACTGT ACAATTGACT GCCGAGGAAT GGTTTAACCT TGGCTATAAC
AAGGGTGAAT CAGGAGACAA CCAAGGTGCG ATCGCTGACT TTAATCAAGC CATTAAAATC
AAATCCGACT TGGCAGAAGC GTACTACAAT CGCGGGTTAG CCAAGTCTAA CTTAGGAGAC
TATCAAGGTG CGATCTCTGA CTACAATCAA GCCATTGAAA TCAAACCCGA CTATGCTGCT
GCCTACAACA ATCGTGGATT AACTAAGTAT AACTTAGGAG ACAACCAAGG TGCGATCACA
GACTACACTC AAGCGATTGA AATCAAACCC GACGATGCTG ATGCCTACTA TAATCGCGGG
TTAGCCAAGT ATAACTTAGG AGACAAGCAA GGGGCGATCG CTGACTACAA TCAAGCGATT
AAAATCAAAC CCGACTATGC TACTGCCTAC AACAATCGCG GGAATGCTAA GTATAACTTA
GGAGACAAGC AAGGGGCGAT CGCTGACTAC AATCAAGCGA TTAAAATCAA ACCCGACTAT
ACCCTTGCCT ACATCTGTTG CGGGTTAGCC AAGTCTAACT TAGGAGACAA CCAAGGTGCG
ATCACTGACT ACAATCAAGC GATTAAAATC AAACCCGACT ATGCTGATGC CTACATCTGT
CGCGGGAATG CCAAGAAAAA CTTAGGAGAC AACCAAGGTG CGATCGCTGA CTACAATCAA
GCAGCACAAC TTTACTCGCA GCAAAATAAT ATGGAATGGT ATCTTAAAGC CCTTGAAAAG
ATCAAAAAAC TTGAACAATG A
 
Protein sequence
MSEGFEFDVF LAHNSVDKPH VREISNKLRE RGLKPWLDEE QIPPGMSFQD EIQKAIPLIK 
SAAIIIGTQG LGKWQIMELR SLITKFVNLK IPVIPVLLPG VNNIPGDLLF LQELNWVKFE
QIDDATAFYR LEWGITQVKP ELHPKTVQLT AEEWFNLGYN KGESGDNQGA IADFNQAIKI
KSDLAEAYYN RGLAKSNLGD YQGAISDYNQ AIEIKPDYAA AYNNRGLTKY NLGDNQGAIT
DYTQAIEIKP DDADAYYNRG LAKYNLGDKQ GAIADYNQAI KIKPDYATAY NNRGNAKYNL
GDKQGAIADY NQAIKIKPDY TLAYICCGLA KSNLGDNQGA ITDYNQAIKI KPDYADAYIC
RGNAKKNLGD NQGAIADYNQ AAQLYSQQNN MEWYLKALEK IKKLEQ