Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0756 |
Symbol | |
ID | 4203491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 897410 |
End bp | 898675 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638081640 |
Product | proton/sodium-glutamate symporter |
Protein accession | YP_695207 |
Protein GI | 110799556 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0642378 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TAGGACTAGC ATTTCAAATT GTACTTGGAC TTATACTAGG TATTATAGTA GGAGCTGTTT TTTATGGTAA TCCAGTTGTT ACTTCATATT TACAACCATT TGGAGATATT TTTATAAGAT TAATTAAAAT GATTGTCATA CCTATTGTCT TTTCATCACT TGTTGTTGGT GTTGCTGGAG TTGGAGATGT TAAAAAATTA GGGAAAATCG GCGGAAAAAC AATACTTTAC TTTGAGATTG TAACTACATT CGCTATTATA ATAGGTTTAG TTGTAGCTAA TTTATTTCAT CCAGGAAGCG GAGTAAACAT TAGTACTCTT GCAACTACTA ATATTGATAA ATATATGAGT ACAGCAGAAG CTGCATCTAA CCATGGATTT ATGGATACAT TTATAAATAT TGTTCCAACT AATATTTTTG AATCTCTTGC AAAAGGAGAT TTACTTCCTA TTATTTTCTT TTCAGTTATG TTTGGATTAG GTGTAGCTGC AATTGGAGAA AAAGGAAAAC CTGTCCTTGC AATATGTCAA GGTATAGCTG ATTCAATGTT TTGGATTACT AATCAAATTA TGAAGCTTGC GCCACTTGGC GTATTTGGAT TAATAGGTGT AACTGTTTCT AAATTTGGAT TAGCTTCATT AATTCCTTTA GGAAAGTTAA TAATTACTGT TTATGGCGCC ATGTTCTTCT TTGTATTTTT TGTTCTTGGC TTTATTGCAA AAATATCTGG AACAAGCATT ATATCACTTA TAAAGCTTTT AAAAGATGAA CTTATTTTAG CTTATACTAC AGCAAGTTCT GAAGCCGTTT TACCAAAGCT TATGGAAAAG ATGGAGAGGT TTGGCTGTCC TAAGGCAATT ACATCTTTTG TTATTCCAAC AGGATATTCA TTTAACTTAG ATGGATCTAC TTTATATCAA TCTATTGCAG CACTTTTTAT TGCTCAAATA TATGGAATTC ACTTACCACT TTCTGCTCAA ATTAATTTAG TGCTTGTATT AATGCTTACT TCAAAAGGTA TGGCTGGAGT TCCTGGTGCA TCTTTTGTAG TACTTTTAGC AACTGTTGGT TCTTTAGGAA TTCCAGTAGC AGGAGTTGCC TTTATTGCTG GTATAGATCG TATCGTTGAT ATGGCGAGAA CTCTTGTTAA TGTACTTGGA AACTCATTAG CTGTTGTTGT TATATCTAAA TGGGAAAAGG AATTTAATGC TGAAGAAGGA CAAAAATATA TTAAATCAGT TAGTGAAATA GCATAA
|
Protein sequence | MKKLGLAFQI VLGLILGIIV GAVFYGNPVV TSYLQPFGDI FIRLIKMIVI PIVFSSLVVG VAGVGDVKKL GKIGGKTILY FEIVTTFAII IGLVVANLFH PGSGVNISTL ATTNIDKYMS TAEAASNHGF MDTFINIVPT NIFESLAKGD LLPIIFFSVM FGLGVAAIGE KGKPVLAICQ GIADSMFWIT NQIMKLAPLG VFGLIGVTVS KFGLASLIPL GKLIITVYGA MFFFVFFVLG FIAKISGTSI ISLIKLLKDE LILAYTTASS EAVLPKLMEK MERFGCPKAI TSFVIPTGYS FNLDGSTLYQ SIAALFIAQI YGIHLPLSAQ INLVLVLMLT SKGMAGVPGA SFVVLLATVG SLGIPVAGVA FIAGIDRIVD MARTLVNVLG NSLAVVVISK WEKEFNAEEG QKYIKSVSEI A
|
| |