Gene CPF_1738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1738 
Symbol 
ID4202015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1959096 
End bp1960064 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content30% 
IMG OID638082610 
Producthypothetical protein 
Protein accessionYP_696174 
Protein GI110800348 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0611482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATTAT TTTCATTAAG TATATTAGTA GGATGTAATA CTTCTAAGAA AGAAGAAGCT 
AAGGCACCAG AAGAAAAAAC ATCCATAGAA ATAGTAGTAC CAGATGGACT TCCAGCTATT
AGTATAGTTA AAATGATAAA AGAAAAACCA GAAATAATAA AAGGCTTAGA TATAAATTAT
TCAATAGTAA AGGGATCAGA TGCTTTAGTT TCTAAGGTGT TAAAAGGAGA GGGAGATATA
TGTATAGTTC CTTCAAATGT AGCTGCTATT GCATATAACA AGGAAGCTAA ATATAAACTT
GCAGGAACAG TAGGTTTTGG TTCATTATAT GTTATAAGCA GTGATGATTC TGTTAATAGC
TTAGAAGATC TTAAAGGAAA AGATGTTTAC AATGTTGGTC AAGGATTGAC TCCAGATTTA
ATATTTAAGA TATTACTTCA AAATGATGGA ATAAATCCTG AAAAAGATTT AACATTAAGT
TATGTAAATG CAGCTTCAGA ATTAGCTCCT TTATTTATAG AGGGAAAAGC TAAATATGCA
GTTGTTCCAG AACCTATGTT AACTCAAATA ATGACAAAGA AACCAGAAAC AAAAATAGTA
GCATCATTAA ATGAACAGTG GAAAAAAATG AGTGATTCAA AAATGGGATA TCCTCAGTCT
AGTGTTATAG TTAAAGAGGA CCTAGCAAAA AATAATTCAG AGGCTGTTCA AAAGATCTTA
AAGGAAATAG ATAATAGTAC TAAGTGGGCA AATGAAAATA AAGAAGAAGC AGGTGCCTTT
GCAGAAGAAG TTGGCATAAC AGGCAAAAAA GAAATAATAG CTAAATCTCT AGAAAGAGCA
AATTTAAATT ACGTAAGTGC TTTAGATAGT GAAAGTGAAT ATATTAAATA TTATGACAAG
ATTTACAGCT TAGAGCCTAA AGCTATAGGA GGTAAAAAGA TAAATGAAGA AATTTTCTTA
CAAAAATAA
 
Protein sequence
MVLFSLSILV GCNTSKKEEA KAPEEKTSIE IVVPDGLPAI SIVKMIKEKP EIIKGLDINY 
SIVKGSDALV SKVLKGEGDI CIVPSNVAAI AYNKEAKYKL AGTVGFGSLY VISSDDSVNS
LEDLKGKDVY NVGQGLTPDL IFKILLQNDG INPEKDLTLS YVNAASELAP LFIEGKAKYA
VVPEPMLTQI MTKKPETKIV ASLNEQWKKM SDSKMGYPQS SVIVKEDLAK NNSEAVQKIL
KEIDNSTKWA NENKEEAGAF AEEVGITGKK EIIAKSLERA NLNYVSALDS ESEYIKYYDK
IYSLEPKAIG GKKINEEIFL QK