Gene CPR_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1840 
Symbol 
ID4204167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2030616 
End bp2031713 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content27% 
IMG OID642566390 
ProductTPR domain-containing protein 
Protein accessionYP_699154 
Protein GI110803525 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00013398 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTA ATACATTTGC AAAAGAAAAA TTATCACAAC TTTTATTTTT AGAAATAGAT 
GGAGATGGAT TTGTTAAAAG CTTAGGAAAA GATCCAAAAG AGGTTAATAT AAATGAAGTT
TATATCCCAA TAGATCCAAA ACATTTATCA CAGGATGTGA AAAGTGGATA TAAATTAGAA
TCTTTACCAA TAAACTATTT AGTTGAAGGA ATGTTTTTCG CTTTAGGGGG AGACAAGGAT
TTTAAATTTA ATAAGGAGTA TAAAAAATTA ATTCCACTAA TAGAGGATGC TATTCCATGT
GTTAAAAAGA TAGTAGCTGA TAAGGTTAAA GAAGAAAATA TGGTTGAAGC ATTTATGCTT
TTAAAGGGAT TAAGCGAAAT ATCAGATGAA ACTGAGGTTT ATGAAAATCT TCTTTTAATA
TGTGAATCTT TAAGAGAAAG AAATAAAGCT TATAATGAAA CTCAACTAGA AATTTGTGAT
AAATGTAAAG CAAATAGAAG TGATTTAGCT CTTCCATATC TTTATTCAGC AATAGCTTAC
AATGACATTG GGCAATATGA TAAAGCTTAT GTTGATATAA ATGAATATTT AGCTAAGGGT
GGAGAAAGAA ATGAAATAGT TGAGGTTTTA TATAATGAGA TAAAAGACTC AGCTGATTAT
GAAGAAGGAA AAGAAGATTT AATAGAAGAA CCAGAGGATG CTTTAAAAAG ACTTCTTCCT
TTAGCAGATA AATTCCAAGA TAATGCTATT TTAAGATATT ATATTGCAAC AGCTTATAGA
AGACTTGGAA ATTTTGAAAA GGCAGTTTAT TACTTAAATG AATGTTTATC AATTGATGAT
AGTATAGTTG AAGCAGTAAA TGAAATGGGA ATAAATTATG CATCTTTAGG AATATATGAT
GAAGCTATTA AATATTTAAG AAAAGCCTTT GAAAGCACAA GAGATATAGA GGTTTGTACT
AACTTAATAG TTTGTTATTT AAATGCAGGA AAAATAGAAG AGGCAAAGCA GCATTTAGAT
ATAGCAAAGG CTATAAACAA GGATGATGAA ATAGTAAAAG AAATAGAAAC ATTTATGAAA
AATAATAATA TTAAATAG
 
Protein sequence
MSFNTFAKEK LSQLLFLEID GDGFVKSLGK DPKEVNINEV YIPIDPKHLS QDVKSGYKLE 
SLPINYLVEG MFFALGGDKD FKFNKEYKKL IPLIEDAIPC VKKIVADKVK EENMVEAFML
LKGLSEISDE TEVYENLLLI CESLRERNKA YNETQLEICD KCKANRSDLA LPYLYSAIAY
NDIGQYDKAY VDINEYLAKG GERNEIVEVL YNEIKDSADY EEGKEDLIEE PEDALKRLLP
LADKFQDNAI LRYYIATAYR RLGNFEKAVY YLNECLSIDD SIVEAVNEMG INYASLGIYD
EAIKYLRKAF ESTRDIEVCT NLIVCYLNAG KIEEAKQHLD IAKAINKDDE IVKEIETFMK
NNNIK