Gene CPF_1477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1477 
Symbol 
ID4202021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1675029 
End bp1676075 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content30% 
IMG OID638082355 
ProductABC transporter, substrate-binding protein 
Protein accessionYP_695920 
Protein GI110800354 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0040976 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AAATTTTAGC AACCTTATTA ACAGGATTAG TACTAGGGAC ATCTTTAGTT 
GGATGTGGAA AAACAGAAGG AGCAGAGGCA GGAAAAAAAT TAGTAGTTTC AACTTGGGGA
TTAAATGAAG ATGTATTAAA GGAGACAGTA TTTGAACCAT TTGCTAAAGA ACATGGTGTG
GAAATAGTTT TAGATATAGG TAATAACTCA GAAAGATTGA CTAAGATGAA AAATAATCCT
AACTCACAAA TAGATATAAC TTATTTAGCA GAATCTTTTG CAGAGCAAGG AGTAGAAGCA
GGTATATTCG ATAAATTAGA TTATTCAAAG ATTCCTAATG CAAGTGAGAT GAATGAAAAG
GCTAAATCTA CAGTAGAGGC AGGATATGGA CCAGCATATA CATTAAATAG TATAGGAATA
GTTGTAGATC CTTCAGCAGG AATTGAAATA AATTCTTGGG AAGATTTATG GAAACCAGAA
CTTAAAAATA AAATAGCTAT ACCAGATATT ACAACAACTA ATGGTCCAGC AATGGTAGAA
ATAGCAGCAG AAAAGGCAGG AGTAGATGTT AAAACTGATA ATGGAGAAGC TGCATTTAAA
GAATTAGAAG CTTTAAAACC AAATGTTGTT AAAACTTATA GTAAATCTTC AGATTTAGCT
AATATGTTCT CAAATGGTGA AATAGTAGCT GCTGTAGCAT CAGACTTTGC TTTTGGAACA
ATTTCAAAAG CTAAACCAGA GGTTATAAAT GTAATACCTG AGTCAGGAAC TTACTTAAAC
TTTAATACAA TAAATATAAA TAAAAATTCT AAAAATAAAG ATTTAGCTTA TGAATTTATT
AACTATGCAT TAAGCAAAGA AGTTCAGGAG AAAACTGCTA AAGCTTTAAA TGAATCACCA
GTTAATAAAG AAGTTAAATT AAGTGAAGAA GAAACTAAAA ACTTAACATA TGGACCAGTG
GTTGATAATG CTAAGGTAAT TGACTTTAAG TTTGTTAATT CAGTAATGGA TCAATGGGTT
AATAACTGGA ACAGAATAAT GAACTAG
 
Protein sequence
MKKKILATLL TGLVLGTSLV GCGKTEGAEA GKKLVVSTWG LNEDVLKETV FEPFAKEHGV 
EIVLDIGNNS ERLTKMKNNP NSQIDITYLA ESFAEQGVEA GIFDKLDYSK IPNASEMNEK
AKSTVEAGYG PAYTLNSIGI VVDPSAGIEI NSWEDLWKPE LKNKIAIPDI TTTNGPAMVE
IAAEKAGVDV KTDNGEAAFK ELEALKPNVV KTYSKSSDLA NMFSNGEIVA AVASDFAFGT
ISKAKPEVIN VIPESGTYLN FNTININKNS KNKDLAYEFI NYALSKEVQE KTAKALNESP
VNKEVKLSEE ETKNLTYGPV VDNAKVIDFK FVNSVMDQWV NNWNRIMN