Gene CPF_0756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0756 
Symbol 
ID4203491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp897410 
End bp898675 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content32% 
IMG OID638081640 
Productproton/sodium-glutamate symporter 
Protein accessionYP_695207 
Protein GI110799556 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0642378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TAGGACTAGC ATTTCAAATT GTACTTGGAC TTATACTAGG TATTATAGTA 
GGAGCTGTTT TTTATGGTAA TCCAGTTGTT ACTTCATATT TACAACCATT TGGAGATATT
TTTATAAGAT TAATTAAAAT GATTGTCATA CCTATTGTCT TTTCATCACT TGTTGTTGGT
GTTGCTGGAG TTGGAGATGT TAAAAAATTA GGGAAAATCG GCGGAAAAAC AATACTTTAC
TTTGAGATTG TAACTACATT CGCTATTATA ATAGGTTTAG TTGTAGCTAA TTTATTTCAT
CCAGGAAGCG GAGTAAACAT TAGTACTCTT GCAACTACTA ATATTGATAA ATATATGAGT
ACAGCAGAAG CTGCATCTAA CCATGGATTT ATGGATACAT TTATAAATAT TGTTCCAACT
AATATTTTTG AATCTCTTGC AAAAGGAGAT TTACTTCCTA TTATTTTCTT TTCAGTTATG
TTTGGATTAG GTGTAGCTGC AATTGGAGAA AAAGGAAAAC CTGTCCTTGC AATATGTCAA
GGTATAGCTG ATTCAATGTT TTGGATTACT AATCAAATTA TGAAGCTTGC GCCACTTGGC
GTATTTGGAT TAATAGGTGT AACTGTTTCT AAATTTGGAT TAGCTTCATT AATTCCTTTA
GGAAAGTTAA TAATTACTGT TTATGGCGCC ATGTTCTTCT TTGTATTTTT TGTTCTTGGC
TTTATTGCAA AAATATCTGG AACAAGCATT ATATCACTTA TAAAGCTTTT AAAAGATGAA
CTTATTTTAG CTTATACTAC AGCAAGTTCT GAAGCCGTTT TACCAAAGCT TATGGAAAAG
ATGGAGAGGT TTGGCTGTCC TAAGGCAATT ACATCTTTTG TTATTCCAAC AGGATATTCA
TTTAACTTAG ATGGATCTAC TTTATATCAA TCTATTGCAG CACTTTTTAT TGCTCAAATA
TATGGAATTC ACTTACCACT TTCTGCTCAA ATTAATTTAG TGCTTGTATT AATGCTTACT
TCAAAAGGTA TGGCTGGAGT TCCTGGTGCA TCTTTTGTAG TACTTTTAGC AACTGTTGGT
TCTTTAGGAA TTCCAGTAGC AGGAGTTGCC TTTATTGCTG GTATAGATCG TATCGTTGAT
ATGGCGAGAA CTCTTGTTAA TGTACTTGGA AACTCATTAG CTGTTGTTGT TATATCTAAA
TGGGAAAAGG AATTTAATGC TGAAGAAGGA CAAAAATATA TTAAATCAGT TAGTGAAATA
GCATAA
 
Protein sequence
MKKLGLAFQI VLGLILGIIV GAVFYGNPVV TSYLQPFGDI FIRLIKMIVI PIVFSSLVVG 
VAGVGDVKKL GKIGGKTILY FEIVTTFAII IGLVVANLFH PGSGVNISTL ATTNIDKYMS
TAEAASNHGF MDTFINIVPT NIFESLAKGD LLPIIFFSVM FGLGVAAIGE KGKPVLAICQ
GIADSMFWIT NQIMKLAPLG VFGLIGVTVS KFGLASLIPL GKLIITVYGA MFFFVFFVLG
FIAKISGTSI ISLIKLLKDE LILAYTTASS EAVLPKLMEK MERFGCPKAI TSFVIPTGYS
FNLDGSTLYQ SIAALFIAQI YGIHLPLSAQ INLVLVLMLT SKGMAGVPGA SFVVLLATVG
SLGIPVAGVA FIAGIDRIVD MARTLVNVLG NSLAVVVISK WEKEFNAEEG QKYIKSVSEI
A