Gene YpAngola_A4067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4067 
SymboldctA 
ID5802546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4330259 
End bp4331548 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content47% 
IMG OID641341848 
ProductC4-dicarboxylate transporter DctA 
Protein accessionYP_001608354 
Protein GI162418786 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.409819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTCT CTATTTTTAA AACGCTTTAT TTTCAAGTGC TAACGGCGAT AACTATTGGT 
GTGTTGCTTG GCCATTTCTA CCCGGAAATC GGCGCTCAGA TGAAACCACT GGGTGATGGA
TTTGTTAAAT TAATTAAAAT GATTATCGCG CCCGTCATTT TTTGTACCGT TGTTACTGGT
ATCGCGGGCA TGGAGAGCAT GAAGGCCGTT GGCCGTACTG GTGCGATTGC ACTGCTCTAT
TTTGAAATTG TCAGTACTCT GGCGCTGCTG ATTGGTTTAG TGGTGGTTAA CGTGGCGCAA
CCCGGTGTTG GGATGAACAT CGATCCTGCG ACACTGGATG CTAAAGCGGT TGCACTTTAT
GCTGAGCAAG CCTCACAACA GGGGATTATT CCATTCTTGC TGGATATTAT TCCCGGTAGC
GTAGTTGGCG CATTTGCCAG CGGCAACATT CTGCAGGTCT TACTGTTTGC GGTCTTGTTT
GGTTTTGCTT TACACCGTTT GGGTGAAAAG GGGCAGTTGA TTTTCAATGT TATTGAAAGC
TTCTCCCGTG TTATTTTTGG TGTCATCAAT ATGATCATGC GCTTGGCACC GCTCGGGGCT
TTCGGTGCGA TGGCCTTTAC TATCGGTAAA TATGGTGTAG GAAGTCTGGT GCAATTGGGG
CAATTAATCC TCTGCTTCTA TTTGACCTGT ATTCTGTTTG TCGTTTTGGT ACTTGGTACG
ATTGCGAAAT TCAATGGTTT TAATATCTTC AAATTTATCC GTTATATCAA AGAAGAGCTG
TTGATCGTGT TAGGCACCTC CTCTTCTGAG TCAGTGCTCC CTCGCATGTT GGATAAAATG
GAGAACGCCG GGTGTAAAAA ATCGGTAGTG GGTCTGGTTA TCCCAACGGG CTACTCATTT
AACCTGGATG GCACCTCTAT TTACCTGACG ATGGCCGCGG TATTTATCGC TCAGGCAACT
AACACCCATA TGGATATTAT GCATCAGGTG ACATTGCTGG TGGTGCTGCT GCTCTCCTCA
AAAGGTGCGG CAGGTGTCAC AGGCAGTGGC TTTATCGTGT TGGCCGCCAC CATTTCTGCG
GTTGGACATT TACCTTTGGC TGGGCTGGCA TTGATTCTGG GGATTGACCG CTTTATGTCC
GAGGCTCGCG CCCTGACTAA TCTGGTAGGT AACGGTGTTG CCACCATTGT GGTCGCTAAA
TGGTGTAAGC AACTGGACAA TGACCAACTG CAAGCGGTGC TATCCAATAA AGTGCTGCCT
AATGTAAAAA GCAGTGTTTC TGTGTCCTGA
 
Protein sequence
MKVSIFKTLY FQVLTAITIG VLLGHFYPEI GAQMKPLGDG FVKLIKMIIA PVIFCTVVTG 
IAGMESMKAV GRTGAIALLY FEIVSTLALL IGLVVVNVAQ PGVGMNIDPA TLDAKAVALY
AEQASQQGII PFLLDIIPGS VVGAFASGNI LQVLLFAVLF GFALHRLGEK GQLIFNVIES
FSRVIFGVIN MIMRLAPLGA FGAMAFTIGK YGVGSLVQLG QLILCFYLTC ILFVVLVLGT
IAKFNGFNIF KFIRYIKEEL LIVLGTSSSE SVLPRMLDKM ENAGCKKSVV GLVIPTGYSF
NLDGTSIYLT MAAVFIAQAT NTHMDIMHQV TLLVVLLLSS KGAAGVTGSG FIVLAATISA
VGHLPLAGLA LILGIDRFMS EARALTNLVG NGVATIVVAK WCKQLDNDQL QAVLSNKVLP
NVKSSVSVS