Gene CPR_0742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0742 
Symbol 
ID4205623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp867786 
End bp869051 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content32% 
IMG OID642565302 
Productproton/sodium-glutamate symport protein 
Protein accessionYP_698068 
Protein GI110803401 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value6.16985e-09 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TAGGACTAGC ATTTCAAATT ATACTTGGCC TTATACTAGG TATTATAATC 
GGTGCTATTT TTTATGGAAA TCCAGTTGTT ACTTCATATT TACAACCATT TGGAGATATT
TTTATAAGAT TAATTAAAAT GATTGTAATT CCTATCGTCT TTTCATCACT AGTTGTTGGT
GTTGCTGGCG TTGGAGATGT TAAAAAATTA GGAAAAATAG GTGGAAAAAC TATTCTTTAT
TTTGAGATTG TAACTACATT CGCTATTATA ATAGGTTTAG TTATAGCTAA TTTATTTCAT
CCTGGAAGCG GAGTAAATAT TAGTACTCTT GCAACTACTA ATATTGATAA ATATATGAGT
ACAGCACAAG CTGCATCTAG CCATGGATTT ATGGATACAT TTATAAATAT TGTTCCAACT
AATATTTTTG AATCCCTTGC AAAAGGAGAT TTGCTTCCGA TTATTTTCTT TTCAGTTATG
TTCGGATTAG GTGTAGCTGC AATTGGAGAA AAAGGGAAAC CAGTTCTTTC ACTATGTCAA
GGTATTGCTG ACTCAATGTT TTGGATTACT AATCAAATTA TGAAACTTGC GCCACTTGGC
GTATTTGGAT TAATAGGTGT AACTGTTTCT AAATTTGGAT TAGCTTCATT AATTCCTTTA
GGAAAGTTAA TAATTACTGT TTATGGCGCC ATGTTCTTCT TTGTATTTTT TGTTCTTGGC
TTTATTGCAA AAATGGCTGG AACAAGCATT ATATCACTTA TGAAACTTTT AAAAGATGAA
CTTATTTTAG CTTATACTAC AGCAAGTTCT GAAGCCGTTT TACCAAAACT TATGGAAAAG
ATGGAGAGGT TTGGCTGTCC TAAGGCAATT ACATCTTTTG TTATTCCAAC AGGATATTCA
TTTAACTTAG ATGGATCTAC TTTATATCAA TCTATTGCAG CACTTTTTAT TGCTCAAATA
TATGGAATTC ACTTACCACT TTCTGCTCAA ATTAATTTAG TGCTTGTATT AATGCTTACT
TCAAAAGGTA TGGCTGGAGT TCCTGGTGCA TCTTTTGTAG TACTTTTAGC AACTGTTGGT
TCTTTGGGAA TTCCAGTAGC AGGGGTTGCA TTTATTGCTG GTATAGATCG TATAGTTGAT
ATGGCGAGAA CTCTTGTTAA TGTACTTGGA AATTCCTTAG CTGTTGTTGT TATATCTAAA
TGGGAAAAGG AATTTAATGC TGAAGAAGGA GAAAAATATA TTAAATCAGT TAGTGAAATA
GCATAA
 
Protein sequence
MKKLGLAFQI ILGLILGIII GAIFYGNPVV TSYLQPFGDI FIRLIKMIVI PIVFSSLVVG 
VAGVGDVKKL GKIGGKTILY FEIVTTFAII IGLVIANLFH PGSGVNISTL ATTNIDKYMS
TAQAASSHGF MDTFINIVPT NIFESLAKGD LLPIIFFSVM FGLGVAAIGE KGKPVLSLCQ
GIADSMFWIT NQIMKLAPLG VFGLIGVTVS KFGLASLIPL GKLIITVYGA MFFFVFFVLG
FIAKMAGTSI ISLMKLLKDE LILAYTTASS EAVLPKLMEK MERFGCPKAI TSFVIPTGYS
FNLDGSTLYQ SIAALFIAQI YGIHLPLSAQ INLVLVLMLT SKGMAGVPGA SFVVLLATVG
SLGIPVAGVA FIAGIDRIVD MARTLVNVLG NSLAVVVISK WEKEFNAEEG EKYIKSVSEI
A