Gene CPF_0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0159 
Symbol 
ID4202395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp187354 
End bp188841 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content32% 
IMG OID638081040 
Productamino acid permease family protein 
Protein accessionYP_694623 
Protein GI110799203 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAGA GAAAAAGGTC GCTTAGTTCA GGGGCTTTAA TGCTTATGAC ATTTACGGCG 
GTATTTTCAT TCGGAAACAT AATTGATAGT AGTGTAAATA TTGGACTAGC TACAATACCA
TCATACATAT TTGGTACAGT ATTTTACTTT TTACCATTTG CTTTGATGAT TGGTGAGTTT
GCTTCAGCTA GTTCAGATTC TGAATCAGGT ATAAATAGTT GGATAAAGAA ATCTTTAGGA
GCAAGATGGG CTTTCTTAGG ATCATGGTCT TATTTCTTTG TTAACTTGTT CTTCTTTACT
TCATTATTAC CTAAAATATT AATTTATGCA TCTTATACCT TTGTTGGTAG AAACGTATTT
GATGGAAAAA CAGTATTAAT TTCTGTTATA TCAATAGTAT TATTCTGGGC AGTAACTATA
ATTTCTACTA AGGGTGTATC TTGGATTTCA AAAATAACAA GTATATCTGG TGTAGCTAGA
ATAATTTTAG GTTTAGGATT CATAGTATTA TCTTTCGGAG TTATTTTATT CTTAGGAAAA
GCTCCAGCTC AAGAATTCAC AGCAGAAACT ATTATGCCTA AGTTTAACTG GTCATATTTC
ATGGTTTTAG CTTGGATTTT ACAAGCAGTA GGTGGTGCTG AAAGTATAGG TGTATATATT
AAAGACGTAA AGGGTGGAAA TAAAACATTC ATAAGAACAA TGGTAATTTC AACTGCTATA
GTTGGTGGTC TATATGCACT TGGAGCTGTT TCAGTAGGTT TAGTTGTACC TTCAGAAGTA
TTACAAGGTA ATTTCTCAAA TGGTTTATTT GATGCCTTTG CAATATTAGG TGCAAACTAT
GGAGTAGGTA ACATAATAAC TAACATAGTT GGATTCATAA TGATGCTTGC TTCTTTAGGT
TCATTAGTTT TATGGACTGC AGCACCTGTT AAAGTATTAT TCTCTGAAAT TCCTGAAGGA
ATCTTTGGTA AATGGATTGC TAAAACAGAT AAAAAAGGAA CTCCTGTAAA TGCTTTATAT
GTACAAGCTG TAATAGTAAC AGTATTACTA TTAGTACCAG CTTTAGGAAT AGGTTCAGTT
GATAGCTTAC TTGAAATGTT AATAAACATG ACTGCTTCAA CTTCATTAAT TCCAGTATTA
TTCTTCTTAG TTGGATACAT TGTATTAAGA GCTAAGAAAG ACCATATGGA AAGATCATTT
AAAGTTGGAT CTAAAAACTT TGGAATAGCA ATTGGAGTAC TATTACTTGC TTTATTCGTA
TTCGTATTTG TAATATCTTC AATTCCAGCT CCACAAGACT TTGCAGCTTA CTTTAATGGA
ACATTAGCAG AAGGAGCAAC AAATCCTGTA TTTATACTTT TATACAATGT ATTAGGATTA
GTATTCTTCT TAGGTTTTGC TGAAATATGC TGGAGAAAAT ATGAAAAGAA AGTTGGAAAA
GCTGTAGCTA ATGAATGGGA TCAAGAAGAG GTTTCAGAAA TAGCTTAA
 
Protein sequence
MSERKRSLSS GALMLMTFTA VFSFGNIIDS SVNIGLATIP SYIFGTVFYF LPFALMIGEF 
ASASSDSESG INSWIKKSLG ARWAFLGSWS YFFVNLFFFT SLLPKILIYA SYTFVGRNVF
DGKTVLISVI SIVLFWAVTI ISTKGVSWIS KITSISGVAR IILGLGFIVL SFGVILFLGK
APAQEFTAET IMPKFNWSYF MVLAWILQAV GGAESIGVYI KDVKGGNKTF IRTMVISTAI
VGGLYALGAV SVGLVVPSEV LQGNFSNGLF DAFAILGANY GVGNIITNIV GFIMMLASLG
SLVLWTAAPV KVLFSEIPEG IFGKWIAKTD KKGTPVNALY VQAVIVTVLL LVPALGIGSV
DSLLEMLINM TASTSLIPVL FFLVGYIVLR AKKDHMERSF KVGSKNFGIA IGVLLLALFV
FVFVISSIPA PQDFAAYFNG TLAEGATNPV FILLYNVLGL VFFLGFAEIC WRKYEKKVGK
AVANEWDQEE VSEIA