Gene CPF_0853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0853 
Symbol 
ID4202271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1010960 
End bp1012219 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content37% 
IMG OID638081736 
Productputative permease protein 
Protein accessionYP_695303 
Protein GI110799021 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2610] H+/gluconate symporter and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATGTAA CAGTTACAGC TTTAGGTGCA GTAATTGCAT TAGTTATTGC AATCACTCTT 
ATCATAAAAA AGGTTCATCC AGCATACGGA CTTATATTAG GAGCTTTAAT AGGTGGACTT
GTTGGTGGAG CAGGACTAAC AGGGACAGTT AATTTAATGA TGGATGGAGC CAAAAATATG
ATGCCTGCTA TCCTTAGAAT ACTTACAGCA GGGGTACTAG CTGGGGTTTT AATTGAATCC
GGAGCGGCAG CCAAGATTGC AGAGACAATA GTTGAGAAGA TTGGAGAATC AAGAGCTCTT
ATAGCCTTAG CAATAGCTAC CATGGTTTTA ACAATGGTTG GAGTTTTTGT AGACGTTTCA
GTAATAACAG TATCTCCAAT AGCTTTAGCA ATAGCTCACA AATCAGGTTT AAGTAAACCA
GCTATATTAT TAGCCATGGT TGGTGGAGGT AAGGCAGGAA ATATAATGTC TCCAAACCCA
AATACCATTG CGGCAGCAGA TAATTTAGGA GTTTCATTAA CTAACGTTAT GATAGCAGGT
ATAGTACCAG CTATATTTGG AGTTATAATT ACATGTATAT TAGCAAGTAG AATAAAACAA
AAAGGTTCTT TAGTAGAATC TGATGAACAT ATGAAAGAGT TAACAGATAT GCCTGGATTT
TTACCAGCGA TGGTTGGACC ATTAGTAGCA ATAATCTTAT TAATGTTAAG ACCAATAGCA
GGCATTGCTA TAGATCCATT AATAGCTTTA CCAGTAGGTG GTATATGCGG AATACTAGCT
ATGGGAAAAA TTAAGCATAT TAACAAATAT GCAACATATG GACTAGCTAA GATGAGTGGA
GTAGCTATAC TTTTAATAGG TACAGGTACT TTAGCGGGAA TAATTGCTAA CTCAGGATTA
AAGGATGTTA TTATAAGTGG AATAAACGCT TTAGGATTAC CAAGCTTTAT GTTAGCTCCA
GTTTCAGGAA TACTTATGTC AGCGGCAACT GCTTCTACTA CTTCAGGAAC AGCGGTGGCT
ACATCAGTAT TTGGACCAAC AATTGTTCAA TTAGGAGTGG CACCACTTGC TACAGCAGCT
ATGATACATG CTGGGGCTAC AGTTTTAGAT CACTTACCTC ATGGAAGCTT CTTCCACGCT
ACTGGTGGAA GTGTTTCAAT GGACATGAAG GAAAGACTTA AACTTATACC ATATGAATCT
TTAGTTGGAT TAACAATGAC AATTGTATCA ACTATAATAT TTGGAATAAT ATTAAAATAG
 
Protein sequence
MNVTVTALGA VIALVIAITL IIKKVHPAYG LILGALIGGL VGGAGLTGTV NLMMDGAKNM 
MPAILRILTA GVLAGVLIES GAAAKIAETI VEKIGESRAL IALAIATMVL TMVGVFVDVS
VITVSPIALA IAHKSGLSKP AILLAMVGGG KAGNIMSPNP NTIAAADNLG VSLTNVMIAG
IVPAIFGVII TCILASRIKQ KGSLVESDEH MKELTDMPGF LPAMVGPLVA IILLMLRPIA
GIAIDPLIAL PVGGICGILA MGKIKHINKY ATYGLAKMSG VAILLIGTGT LAGIIANSGL
KDVIISGINA LGLPSFMLAP VSGILMSAAT ASTTSGTAVA TSVFGPTIVQ LGVAPLATAA
MIHAGATVLD HLPHGSFFHA TGGSVSMDMK ERLKLIPYES LVGLTMTIVS TIIFGIILK