Gene CPR_1012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1012 
Symbol 
ID4203965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1153130 
End bp1154599 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content31% 
IMG OID642565569 
Productamino acid permease family protein 
Protein accessionYP_698335 
Protein GI110801741 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000367074 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAGG ATAAAAAAAT AAAGTTATGG GCATTGGTAA TGCTTATTTT CGTACCAACA 
TTTGGTTTTG GTAATATAGC AAGTAATGCT GTGTATTTAG GGCCAGCAGC TATTCCATCG
TGGGTAATAG TATCTATTCT TTATTTCTTA CCTTTATGTG GAATAATAGC AGAAATGGCT
TCAGCAAATA GAGATAAAGA AGGTGGGATA TATTCATGGA TAAATAAAGC TTTAGGTGAA
AAATGGGCTT TTGTGGGAAC TTGGACTTAT TTCATAGGGA TATTATTTTA TTTACAAATG
GTTTTTTCAA GAATACCAGT AGCTGCATCA TGGGCAATAC TTGGAAGAAA TATATTTACT
GATTCTAATG CATATTTATT ACCTATTTTA TCAATTGTAA TATGTATAGC AATGACATAT
GTAGCCACTA TAGGAGTAAG TAAGTTTTCA AAACTTGCTG ACTTTGGAGG ACAATTTACT
TTAGCAGCAA CTGTTATATT TATATTAATG GCAATAGCAG GATATTTCAT TGGCAAACCT
TCTGCAACAG AATTTACAGT TCAAAATATA ATTCCAGATT TTAGTGTGAG TTATTTTTCT
ACATTCTCAT GGCTACTATT TGCAGTGTCA GGATCAGAAG TTGCAGGTAC TTATATAATG
CAAACTGAAA ATCCTAAAAA GACATTTCCA AAGGCTATGA TAATTGCAAC AATTTTAATA
GCACTCTCAT ACATTTTAGG TTCTGTAGCT ATACAGTTTA TAGCATCACC AGAAGTTTTG
CAAAAGGCAG GAATTCAAGA TGCCGGTTAC GTTGTATATT ATATATTAGC TAATAATTTC
GGAATAAATG GTAAGGTTAT AGTACAAGTA TATGCAGCTA TTAATTTAAT AACATCAATA
GCAGCATATA TAATTTGGAT GGAATCACCA ATAAGAGCCA TGTTTGGAGA AGTTCCAAAG
GGAACATTTC CAAGTTTCTT AACTAAGAAG AGAAAAGATG GAACTTTAGT AAATGCTTTA
TGGACACAAT GTGCAATTTT AGTAGTATTA ATATCAATAC CTTTATTAGG AATAGGATCA
ATTAATGATT TCTTTAAATT ATTAACAGAT TTATCATCTC TAGCAGTTGT TGTTCCATAT
GTAGTCTTAA TATGTGCATT TATATCTTTT AGAAAACACA ATAAAGATTT AGACTTTAAA
TTTTTTAAAA GTGATGCTTT AGCATACACA GTTGCAGGAA TAGCTTTAGT ACTTTCTTGT
GCTGGATTCT TTGGTGCTGG CCTTCAAGAC ATTGTTAGAA GTTCAGGGAA AGACGCAACA
ATTTTAATAA TAAAAACTTA TGGGGGACCT GTTATTCTTA TGGCTTTAGG TCTTGTTTTA
AGAGCTTTAT CTGAAAAGTC ATATAAAAAT AAGAGTTTAG GCAATGAGAA TATTGAAGTT
GAAATTGTTG AAGAGAGTAT ATGTGAATAG
 
Protein sequence
MGKDKKIKLW ALVMLIFVPT FGFGNIASNA VYLGPAAIPS WVIVSILYFL PLCGIIAEMA 
SANRDKEGGI YSWINKALGE KWAFVGTWTY FIGILFYLQM VFSRIPVAAS WAILGRNIFT
DSNAYLLPIL SIVICIAMTY VATIGVSKFS KLADFGGQFT LAATVIFILM AIAGYFIGKP
SATEFTVQNI IPDFSVSYFS TFSWLLFAVS GSEVAGTYIM QTENPKKTFP KAMIIATILI
ALSYILGSVA IQFIASPEVL QKAGIQDAGY VVYYILANNF GINGKVIVQV YAAINLITSI
AAYIIWMESP IRAMFGEVPK GTFPSFLTKK RKDGTLVNAL WTQCAILVVL ISIPLLGIGS
INDFFKLLTD LSSLAVVVPY VVLICAFISF RKHNKDLDFK FFKSDALAYT VAGIALVLSC
AGFFGAGLQD IVRSSGKDAT ILIIKTYGGP VILMALGLVL RALSEKSYKN KSLGNENIEV
EIVEESICE