Gene CPR_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1034 
Symbol 
ID4205766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1175286 
End bp1176908 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content29% 
IMG OID642565591 
Productamino acid permease family protein 
Protein accessionYP_698357 
Protein GI110802934 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.696759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCAA AAGAAAAAAG CAAATTAACC CTAACAGCAT TAATTCTTAT GATATTTACA 
TCTGTTTATG GCTTTACTAA TATGCCAAGA ACCTTTTATT TAATGGGTTA TAGTGGAATA
ATATGGTATA TAATGTCAGC TATTTTATTT TTTATTCCTT ATGCTTTTAT GATGGCTGAA
TATGGTGCTG CCTTTAGAAA AGAAAAAGGT GGTATATATT CTTGGATGGC TAAATCAGTA
AATCCTAAAT TTGCTTTTAT AGTAACATTT ATGTGGTTTT CTTCAAATCT AATATGGATG
GTATCTGTAT CATCTTCTAT ATGGATTCCT TTATCTAATG TATTTTTTGG TAAAGATACT
ACTTCTACTT GGAATATATT CGGTTTAACA GGTCCTCGTG CAATGTCTGT ATTAGCTATA
ATATTATTAA TAATAATAAC ATTTATTTCT TCTAAAGGAT TAAATAAAAT TTCAAAAATT
ACATCCATAG GAGGAACAGT TGTTGCCTTG TCTAATGTAG TTTTAATTCT TGGTGGTTTT
TTCGTACTAG CAAGCAATGG TTTTAAAACT GCTCAAGGCT TTAACATGGA ACAGTTTATA
CATTCTCCAA ATCCCGCATA TCAATCCCCT TTAGGCGTAC TTGGCTTCGT TGTATTTGCA
ATATTTGCTT ATGGTGGGAT AGAAGTAGTT GGTGGATTAG TTGACCAAAC TGAAAACCCT
AAAAAGAATT TTCCTAAAGG AATAAAAATT TCTGCCTTTG TAATTGTTAT TGGTTATGCA
ATAGGAATTT TATGTATTGG TTTCTTTGTT AATTGGAATA GGGATTTAAC TGGTTCTGAT
GTAAATATGG CAAATATAGC ATATATTATA GTAAATTATC TTGGTTATTA CATAGGAGAA
GCAATTGGCT TAAGTGCTAA TGCTTGTACA ATATTAGGTA ACTTCTTTGC TAGATTTATA
GGACTTTCAA TGTTTTTAGC ACTTATAGGT GCCTTCTTCG CTTTAAGTTA CTCTCCTTTA
AAACAATTAA TAGAAGGTAC TCCTAAGGAT ATTTGGCCTG AAAAATGGAC AAGATTAAAC
AAAGAAAATA TGCCAGCTAC AGCTATGTGG ATTCAATGTA TAACAGTAAT TATATTTGTT
TTATGTTGCT CAATGGGTGG CGAAGATTCA TCTAAATTCT TTAACTATTT AATACTTATG
GGGAATGTAG CCATGACATT ACCTTATATG TTTTTATCCT TTGCTTTTTC ATTCTTCAAG
AAGAAAAAAG AAATTGTAAA ACCTTTTGAA GTATATAAAA ATTATAAGAG CGCTTTAGTT
TGGTCTATCA TAGTAACATT AACAGTTGGT TTTGCTAATA TATTTACAAT AATACAACCT
CTTACTTCAG AAAATAAAGA TTATGTTGCT GTAATCTTCC AAATTTCTGG TCCTATTATA
TTTGGATTCT TAGCATGGTT AATATATAAA AGATATGAAA ACAATGTACT AAAAAAACAA
ATTTCAAATA TTGATAATAA AATTGATACA GAACTTGAAA AAATAGAAAA AGCTGAAGAA
GAAGTTACAA GTGAAACAAT AGGAGTAATA GATATAGAAG TTCATGATGA TGATTCTAAA
TAA
 
Protein sequence
MESKEKSKLT LTALILMIFT SVYGFTNMPR TFYLMGYSGI IWYIMSAILF FIPYAFMMAE 
YGAAFRKEKG GIYSWMAKSV NPKFAFIVTF MWFSSNLIWM VSVSSSIWIP LSNVFFGKDT
TSTWNIFGLT GPRAMSVLAI ILLIIITFIS SKGLNKISKI TSIGGTVVAL SNVVLILGGF
FVLASNGFKT AQGFNMEQFI HSPNPAYQSP LGVLGFVVFA IFAYGGIEVV GGLVDQTENP
KKNFPKGIKI SAFVIVIGYA IGILCIGFFV NWNRDLTGSD VNMANIAYII VNYLGYYIGE
AIGLSANACT ILGNFFARFI GLSMFLALIG AFFALSYSPL KQLIEGTPKD IWPEKWTRLN
KENMPATAMW IQCITVIIFV LCCSMGGEDS SKFFNYLILM GNVAMTLPYM FLSFAFSFFK
KKKEIVKPFE VYKNYKSALV WSIIVTLTVG FANIFTIIQP LTSENKDYVA VIFQISGPII
FGFLAWLIYK RYENNVLKKQ ISNIDNKIDT ELEKIEKAEE EVTSETIGVI DIEVHDDDSK