Gene CPF_1672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1672 
Symbol 
ID4202100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1890817 
End bp1892205 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content33% 
IMG OID638082546 
Productsodium:alanine symporter family protein 
Protein accessionYP_696110 
Protein GI110800336 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1115] Na+/alanine symporter 
TIGRFAM ID[TIGR00835] amino acid carrier protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.251428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACT TATTAAATCA AATCGACAAT TTAGTATGGG GAGTGCCACT TCTTATGCTG 
CTTGTAGGAA CAGGAATATA CCTGACTATA AGACTAAAAT TATTACAAAT TTTAAAATTA
CCATTAGCCT TAAAGTATGT ATTTAAAAAA GATGAGGAAT CTACTTGTGA GGATGCAGAA
GGAGATGTTT CAAGTTTTGG AGCCTTATGT ACAGCCCTTT CAGCTACTAT AGGTACTGGA
AACATAGTTG GTGTTGCCAC TGCTATAAAG GCAGGGGGAC CAGGAGCATT ATTTTGGATG
TGGGTAGCTG CTTTCTTTGG AATGGCAACT AAATATGCAG AAGGTGTACT TGCTATAAAG
TATAGGGTTG TAGATGAAAA TGGTCAAATG GCTGGTGGAC CAATGTACTA TATAAAAAAC
GGACTTGGTT TAAATTGGCT AGCTAATATT TTTGCTTTCT TTGGAATTGG TGTAGCTTTA
TTAGGAATAG GAACTTTTGG ACAAGTTAAG TCCATAACTG ATGCAGCAAG TATTACATTT
AATGTTCCTA CAATTATTAC AGCAGGAGTA GTAACTTTAT TAGTGGCTTT AGTAATTTTA
GGTGGAATAA AAAGAATATC TAGCGTATCA GAAAAGGTAG TTCCTTTAAT GGCGGGACTT
TATATATTAG GAGTTTTAAT TGTTATAGCT TTTAATTTAG ATAAGGTTCC ACATGCAGTA
TCAATAATTA TTGAAAGTGC CTTTAATACT AAGGCTGCTT TAGGCGGAGC TGTAGGAGTT
AGCATAATAA CTGTAATGAA AAGTGGAATA GCTAGAGGGG TTTTCTCTAA TGAAGCTGGG
CTTGGAAGTG CTCCAATAGC AGCGGCGGCA GCTAAAACTA AGTCTCCAGT TAAGCAAGGA
CTTATTTCAA TGACAGGTAC ATTCTTTGAT ACAATTCTTA TTTGTACAAT GACAGGTATA
GTAATAATTC TTACTGGTGC TTATAGTGGA AGTTTAGAAG GAGCAGCACT TACAACACAG
GCTTTTGAAA TAGGTCTTCC TATAAGTAAT ATAGGAACAT ATATAGTTAA TATAGGACTT
ATGTTCTTTG CATTTACTAC AATATTAGGA TGGAACTATT ATGGAGAAAG ATGCATTGAG
TATTTATTTG GAATAAAAGC TATAAAACCA TATAGAATTT TATATATAAT TTTAGTTGCT
ATAGGATCAT TCTTACCATT AACATTAATA TTTATAATTG CAGATATTGT TAATGGATTA
ATGGCAATTC CAAACCTTGT AGGTATTATT GGATTAAGAA AAGTAGTAAT AGAAGAAACA
GAGGAATTCT TTAGGGAAAA AGCTTTAAGT GAAGAGAGTG CAGAATTAGA AGGAACTGTT
TTAAATTAA
 
Protein sequence
MENLLNQIDN LVWGVPLLML LVGTGIYLTI RLKLLQILKL PLALKYVFKK DEESTCEDAE 
GDVSSFGALC TALSATIGTG NIVGVATAIK AGGPGALFWM WVAAFFGMAT KYAEGVLAIK
YRVVDENGQM AGGPMYYIKN GLGLNWLANI FAFFGIGVAL LGIGTFGQVK SITDAASITF
NVPTIITAGV VTLLVALVIL GGIKRISSVS EKVVPLMAGL YILGVLIVIA FNLDKVPHAV
SIIIESAFNT KAALGGAVGV SIITVMKSGI ARGVFSNEAG LGSAPIAAAA AKTKSPVKQG
LISMTGTFFD TILICTMTGI VIILTGAYSG SLEGAALTTQ AFEIGLPISN IGTYIVNIGL
MFFAFTTILG WNYYGERCIE YLFGIKAIKP YRILYIILVA IGSFLPLTLI FIIADIVNGL
MAIPNLVGII GLRKVVIEET EEFFREKALS EESAELEGTV LN