Gene CPR_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0072 
Symbol 
ID4204543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp80628 
End bp81692 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content27% 
IMG OID642564618 
Productextracellular solute-binding protein 
Protein accessionYP_697413 
Protein GI110802721 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGAA TAAAGAAAAT ATTATCATTA GGTTTGCTTT TAGTATTTTC ATTTTCAATG 
GTATCTTGTG GTGGATCAAA AGAAAATACT TTAGAGGATA CAGTAGTAGT AACAAGTAAA
TATCCTGATG AAGTTAATAG TTATATAGCT GAAGAATTTA AGAAAGAAAC TGGAATATCA
GTTAAATATG AAGTGAAAGA TGAAATAAAA GAAGATGATT TTAAAAATTC TAATACTGAT
ATTATCTTAG GTGGAAATAG CGAATTATAT AAAAAAATGG CTTCAGATAA TATTCTTAAA
GGATATAAAA CAAGTTGGTA TAGCGATGTA GATGATAATT ATAGAGATAA AGATGGATAT
TGGTATTCAA TATTTAGAAA CCCTATGGTA GTTGCTTATA ATAAAGCTAA CTTAGCAGCG
AATCTTGTAC CAAAGAGTTT AGCTGATTTA AAAAACGGGA ATTTAGCTAA TAAATTATTA
ATGGTAAATT CCAATAATGA TTATACAAAG TATTTTATAT CTGCTACAGC TTCTTATTTA
ACTAAAGAAG CAAATAATGA TGATAATATA GGAAATACTT TCTTACAAGG TGTAAAGTTA
AATGTGGCTA CATTTTTTAA TAATTACGAT GAATTATTTA CAGCTTTAGA CACTAAAGAA
ACTCCAATAG GAATTTTACC TTTAGATGTT TTAAATAAAA AAATTAAAGA TAATGCTAAT
ATAACAAGAA TTGATTTTGA AGAGGGTGTA CCCGTTATAA CTGAATGTGC AGGTATATTA
AAATCAGCTC CTAATCCAAA TGCTTCAGAA CTATTTATGG AGTTTGTAGC TGGGCCAAAG
ATTCAATTAG AACTAGCTCA GAAATTTAAT ATAATGCCTA CATTACCTGT AGCAATAAAA
TATTCTCCTG ACTGGATTAA GAATTTTAAA ACTTTAGATA TAGAAAATAA TGTTGTTCTT
GAGAATGAAG ATAAATGGGT TCAATTCTTT AATGGTGTTG TTAAACCAGA AGTACCTGCC
AAGACAACTA ATAATCCTGT TATTAAAGGT AAGAAGAAAT CTTAA
 
Protein sequence
MKGIKKILSL GLLLVFSFSM VSCGGSKENT LEDTVVVTSK YPDEVNSYIA EEFKKETGIS 
VKYEVKDEIK EDDFKNSNTD IILGGNSELY KKMASDNILK GYKTSWYSDV DDNYRDKDGY
WYSIFRNPMV VAYNKANLAA NLVPKSLADL KNGNLANKLL MVNSNNDYTK YFISATASYL
TKEANNDDNI GNTFLQGVKL NVATFFNNYD ELFTALDTKE TPIGILPLDV LNKKIKDNAN
ITRIDFEEGV PVITECAGIL KSAPNPNASE LFMEFVAGPK IQLELAQKFN IMPTLPVAIK
YSPDWIKNFK TLDIENNVVL ENEDKWVQFF NGVVKPEVPA KTTNNPVIKG KKKS