Gene CPR_1466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1466 
Symbol 
ID4204492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1645389 
End bp1646357 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content30% 
IMG OID642566020 
Producthypothetical protein 
Protein accessionYP_698785 
Protein GI110803172 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0764811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATTAT TTTCATTAAG TATATTAGTA GGATGTAATA CTTCTAAGAA AGAAGAAGCT 
AAGGCGCCTG AAGAAAAAAC ATCCATAGAA ATAGTAGTAC CAGATGGACT TCCAGCTATT
AGTATAGTTA AAATGATAAA AGAAAAACCA GAAATAATGA AAAACTTAGA TATAAATTAT
TCAATAGTAA AGGGATCAGA TGCTTTAGTT TCTAAAGTGT TAAAAGGAGA GGGAGATATA
TGTATAGTTC CTTCAAATGT AGCTGCTATT GCATATAACA AAGAAGCTAA ATACAAACTT
GCAGGAACAG TAGGTTTTGG ATCATTATAT GTTATAAGCA GTGATAATTC TGTTAATAGC
TTAGAAGACC TTAAAGGAAA AGATGTTTAC AATGTTGGTC AAGGATTGAC TCCAGATTTA
ATATTTAAAA TATTACTTCA AAATGATGGA ATAAATCCTG AAAAAGATTT AACACTAAGT
TATGTAAATG CAGCTTCAGA ATTAGCTCCT TTATTTATAG AAGGAAAGGC TAAATATGCA
GTCGTTCCAG AACCTATGTT AACTCAAATA ATGACAAAGA AACCAGAAAC AAAAATAGTA
GCATCATTAA ATGAACAGTG GAAAAAAATG AGTGATTCAA AAATAGGATA TCCTCAGTCT
AGTATTATAG TTAAAGAGGA CTTAGCAAAA AATAATTCAG AGGCTGTTCA AAAGATCCTA
AAGGAAATAG ATAATAGTAC TAAGTGGGCA AATGAAAATA AAGAAGAAGC AGGTGCCTTT
GCAGAAGAAG TAGGCATAAC AGGCAAAAAA GAAATAATAG CTAAATCTCT AGAAAGAGCA
AACTTAAATT ATGTAAGTGC TTTAGATAGT GAAAGTGAAT ATATTAATTT TTATGACAAG
ATTTACAGCT TAGAGCCTAA AGCTATAGGA GGTAAAAAGG TAAATGAAGA AATTTTCTTA
CAAAAATAA
 
Protein sequence
MVLFSLSILV GCNTSKKEEA KAPEEKTSIE IVVPDGLPAI SIVKMIKEKP EIMKNLDINY 
SIVKGSDALV SKVLKGEGDI CIVPSNVAAI AYNKEAKYKL AGTVGFGSLY VISSDNSVNS
LEDLKGKDVY NVGQGLTPDL IFKILLQNDG INPEKDLTLS YVNAASELAP LFIEGKAKYA
VVPEPMLTQI MTKKPETKIV ASLNEQWKKM SDSKIGYPQS SIIVKEDLAK NNSEAVQKIL
KEIDNSTKWA NENKEEAGAF AEEVGITGKK EIIAKSLERA NLNYVSALDS ESEYINFYDK
IYSLEPKAIG GKKVNEEIFL QK