Gene CPR_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2046 
Symbol 
ID4206454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2264082 
End bp2265572 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content32% 
IMG OID642566596 
Productamino acid permease family protein 
Protein accessionYP_699355 
Protein GI110801782 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAATA GTAATGATGG AAAAAAACTT ATGTGGTATA ACCTTGGTTT AATGGCCTTT 
GTATCAGTTT GGGGCTTTGG TAACGTAGTA AATAACTTCG CAACTCAAGG TTTAACCGTA
ATAACTTCAT GGATATTAAT AATAGCTTTA TACTTCGTAC CATATGCGCT TATGGTTGGA
GAATTAGGCT CAACTTTTAG AGATAGTAAA GGTGGCGTAA GTTCGTGGAT AAGTAAAACA
ATGGGGCCTA CATTAGCTTA CTTAGCAGGT TGGACTTATT GGGTGGTACA TGTGCCTTAT
TTAGCGCAAA AACCACAAGC AGTACTTGTT TCATTAGGAT GGGCAGTATT CCAAGATGGA
AGCACTATAA AAGGTATAGA TTCTAAAATT ATTCAATTAG TATGTTTAGT AGTATTCTTA
TTCTTTGTAT GGATTGCCTC AAGAGGAGTA AATTCATTAG GGAAAATAGG TACAATAGCA
GGAACAGCAA TGTTTGTTAT GTCTATTCTT TATATAGTAC TTATGTTAAC AGCACCTGCT
ATTACAGGAA CTTCTATAGC AAGTCCAAAT ATGACTTCTA TAAAAACCTA TATACCAAAA
TTTGATTTTG CCTATTTTAC CACTATAGCT ATGTTAGTAT TCTCAGTTGG AGGAGCTGAA
AAGATATCTC CATATGTTAA CAACATGAAA GATTCTAAAA AAGGTTTCTC AAAAGGTATG
ATAGCTTTAG CTATAATGGT TGCAGTTACA GCACTTCTTG GATCAGTAGC AATGGGAATG
ATGTTTGATG CAAATAATGT TCCTGATGAC TTAATGTTAA ATGGTGCTTA CTATGCATTC
CAAAAGTTAG GTAATTATTA TGGAATAGGA AATTCTTTAT TAATATTATA TGCATTAGCA
AACTTTGCAG CTCAAGTTTC AGCATTAGTA TTCTCAATAG ATGCTCCATT AAAAGTTTTA
TTATCAGATA CTGATGCAAG ATATGTTCCA ATAGCATTAA CTAAGACTAA TAAGAATGGG
GCTCCAATAA ATGGATATAT AATGACTTCT ATTTTAGTTG GAATATTAAT AATAGTTCCA
GCTTTAGGAA TTGGAAACTT TAATGCTTTA TTTACTTGGT TATTAAAATT AAATGCTGTT
GTTATGCCAA TGAGATATTT ATGGGTATTC TTAGCTTACA TAATGTTAAG AAAAGCTATA
AAAGGAAAGT TCAAATCAGA GTACAAATTT GTTAAAAATG ATAAATTCGC AATGTTAATA
GGTACTTGGT GTTTTGTATT TACAGCATTT GCTTGTATAT TAGGTATGTT CCCAACAGAT
GTTAAAGCAT TCTCAGGAGA ATGGATTTTC AGAGTAGGAA TGAATATTGG TACACCTTTA
GTATTAATAG GATTAGGTTT AATATTACCT AAAATAGCTA AGAGAACTAA CGGACAAGCA
TACAAGGATG CAGTAAGAGA AGCTACAGCA ACAAAGTTAG AACTTAATTA G
 
Protein sequence
MGNSNDGKKL MWYNLGLMAF VSVWGFGNVV NNFATQGLTV ITSWILIIAL YFVPYALMVG 
ELGSTFRDSK GGVSSWISKT MGPTLAYLAG WTYWVVHVPY LAQKPQAVLV SLGWAVFQDG
STIKGIDSKI IQLVCLVVFL FFVWIASRGV NSLGKIGTIA GTAMFVMSIL YIVLMLTAPA
ITGTSIASPN MTSIKTYIPK FDFAYFTTIA MLVFSVGGAE KISPYVNNMK DSKKGFSKGM
IALAIMVAVT ALLGSVAMGM MFDANNVPDD LMLNGAYYAF QKLGNYYGIG NSLLILYALA
NFAAQVSALV FSIDAPLKVL LSDTDARYVP IALTKTNKNG APINGYIMTS ILVGILIIVP
ALGIGNFNAL FTWLLKLNAV VMPMRYLWVF LAYIMLRKAI KGKFKSEYKF VKNDKFAMLI
GTWCFVFTAF ACILGMFPTD VKAFSGEWIF RVGMNIGTPL VLIGLGLILP KIAKRTNGQA
YKDAVREATA TKLELN