Gene VC0395_A1377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1377 
SymboldctP-1 
ID5137567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1482096 
End bp1483061 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content45% 
IMG OID640532835 
ProductC4-dicarboxylate-binding periplasmic protein 
Protein accessionYP_001217320 
Protein GI147673037 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCA TTAATAAGAT TACTATCGCA ATACTTACTT TGAGTGCTGC TGCTTCTGTC 
AATGCTGCGA CGACTTTAAA GATGGGGATG CAAGCTTCTG TGGGGTCTGT AGAGTATAAC
TCGGCAAAAA TGCTTGCCGA CACATTAGAA GAAATGAGTC AAGGAGAGAT CAAACTCGCT
TTGTACCCAA GCGCCCAGCT TGGTGATGAT CGTGCCATGC TTCAGCAATT GACGCTGGGA
GATCTCGATA TAACTTATGC TGAGTTTGGT CGTATGGGGC TTTGGATACC GCGAGCAGAA
GCGGTCATGC TCCCTTATGT CGCGAAAGAT TTTGACCATT TACGCCGCAT GTTTGAATCT
GACTTTGGTC AAGGTGTTCG TGATGAAATG CTCCAGAAGT TCAACTGGCG TGCTTTGGAC
ACTTGGTATA ACGGTACCCG TGAAACCACT TCAAACCGTC CCCTCAATTC GATTGAAGAT
TTTAAAGGGT TAAAACTTCG AGTCCCGAAT GCTAAGCAAA ACCTCAACTA TGCAAAGCTG
TCTGGTGCCT CGCCAACCCC GATGTCATTC TCTGAAGTTT ATTTAGCGCT GCAGACCAAT
GCCGTAGATG GGCAAGAAAA CCCGCTACCA ACAATTAAAA CAATGAAGTT CTATGAAGTG
CAAAAGAACT TAGCCATGAC ACATCATATT GTTAACGATC AAATGGTGAT CATTTCGGAA
TCTACTTGGC AGAAGCTTTC TGATACGGAT AAAGACATCA TTCAGAAAGC CGTGCAGAAA
GTGGGAGATG CTCATACACA GACCGTTAAA ACTCAAGAGG CAGAATTGGT CTCCTTCTTC
AAGAGTGAAG GTATCAACGT GACTTACCCA GATCTGGAGC CATTCCGAGA AGCGATGCAA
CCACTTTACA AGGAGTTTGA CAGTAACATC GGTCAGCCGA TTGTGTCGAA ATTGGCAGCA
ATGTAA
 
Protein sequence
MKTINKITIA ILTLSAAASV NAATTLKMGM QASVGSVEYN SAKMLADTLE EMSQGEIKLA 
LYPSAQLGDD RAMLQQLTLG DLDITYAEFG RMGLWIPRAE AVMLPYVAKD FDHLRRMFES
DFGQGVRDEM LQKFNWRALD TWYNGTRETT SNRPLNSIED FKGLKLRVPN AKQNLNYAKL
SGASPTPMSF SEVYLALQTN AVDGQENPLP TIKTMKFYEV QKNLAMTHHI VNDQMVIISE
STWQKLSDTD KDIIQKAVQK VGDAHTQTVK TQEAELVSFF KSEGINVTYP DLEPFREAMQ
PLYKEFDSNI GQPIVSKLAA M