Gene VC0395_A1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1943 
SymbolthrA 
ID5135687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2073083 
End bp2075542 
Gene Length2460 bp 
Protein Length819 aa 
Translation table11 
GC content50% 
IMG OID640533400 
Productbifunctional aspartokinase I/homeserine dehydrogenase I 
Protein accessionYP_001217867 
Protein GI229259765 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase
[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0938577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGTAT TAAAGTTTGG AGGTTCATCG CTGGCGGATC CGGAGCGTTT TCTACGCGCT 
GCCGATATCA TCGCGAATAA CGCACAGCAG GAAGAGGTGG CCGTTGTCCT CTCTGCTCCG
GGAAAAACCA CCAATAAATT GGTGGCAGTG ATTGAAAGTG CTTTACGTCA GGGAGAGGTG
GAAGCGCAAA TCGTAGAGCT AGAGAGCTCA TTCTATGCCT TGCTTGATGG CATTAAAGCG
CAGCTCCCGA ATTTGGATGA CAGTGCTTAT CAACAGCAAG TGCACTCTTC CATGACTCAG
TTACGCCAGT TTGTGCACGG CATTACGTTG CTCGGTATGT GTCCAGACAA TGTGAATGCA
CGCATTATCA GTAAAGGTGA ACGCGTTTCC ATTCAATTGA TGAAAGCGGT GATGGAAGCC
AAAGGTCTGC CAGCGAATTT GGTTGATCCC GTTAAATACC TGCTTGCCAA AGGGGATCAT
CTTGAAGCTA TGGTCGATGT GGAAATTTCG ACTCAGCGTT TCAGACAAGC ACCATTGCCA
CAACAGCACG TCAATATCAT GCCGGGCTTT ACTGCGGGTA ATGCACAGGG TGAGTTGGTC
TGTTTAGGGC GTAACGGTTC GGATTACTCA GCCGCGGTAC TGGCTGCCTG TTTACGTGCG
GATTGTTGTG AAATCTGGAC GGATGTGGAT GGAGTTTATA ACTGCGATCC GCGTTTGGTC
GATGATGCGC GCCTTCTTAA ATCCCTTAGC TATCAAGAGG CGATGGAGCT TTCTTACTTC
GGAGCATCTG TGTTGCATCC GAAGACCATT GCACCGATTG CTCAATTTCA AATCCCTTGT
TTGATCAAAA ACAGCTTCAA TCCACAAGGT GCGGGCACCT TGATTGGCCA AGACACTGGC
GAAGATAAAC TCGCGATCAA AGGCATTACC ACTCTGAGCA ATCTCACTAT GGTCAACGTT
TCTGGTCCAG GAATGAAAGG CATGGTGGGG ATGGCGAGCC GCGTGTTTGG CGCGATGTCA
GCGGCCGATG TGTCGATCGT GCTTATCACT CAATCTTCTT CGGAATACAG CATCAGTTTC
TGTATCGAAG CACAACATAA AGCTTTGGCA CAGCAAGCGC TGGCGGATGC ATTTGAACTG
GAACTCAAAG ATGGCCTGCT TGAGCCGGTT GAGTTTGTCG ATAACGTCGC CATCATCACG
CTGGTTGGTG ATGGTATGCG CACCTCTCGC GGTGTCGCGT CGCAATTTTT CTCATCTCTG
GCCGAAGTAC ACGTCAACGT GATTGCAATT GCCCAAGGCT CTTCTGAGCG TGCGATTTCA
GCGGTGATCC CCGATGATAA GATTTCAGAA GCGATCAAGG CTTGCCACGA AAACCTCTTC
AACTCTAAAC ACTTTTTAGA TGTGTTTGTG GTCGGTGTTG GTGGCGTGGG TGGTGAGTTG
GTGGATCAAA TTCAGCGTCA ACAAGCCAAG CTTGCGGAAA AAGGCATCAT GATGCGCGTT
TGTGGCCTCG CTAACAGTAA AGGCATGCTG CTTGATAGCC AAGGCTTGCC GTTGGAGCAG
TGGCGCGATC GTATGGTCAA TGCGGATCAA GCGTTCAGTT TAGAGAATCT GGTGGCTTTG
GTGCAGCGTA ATCACATCAT TAACCCAGTA TTGGTGGATT GCACGTCCAG TGATGAGATT
GCTAACCAGT ATGCGGATTT CCTCGCCGCA GGTTTCCATG TGGTGACCCC GAACAAGAAA
GCCAATACCG CTAGCATGAG TTATTACCAT CAGCTACGTA ATGTGGCGCG TCACTCGCGT
CGTAAACTGA TGTATGAAAC TACAGTTGGC GCGGGTCTGC CCGTTATCGA AAACTTGCAA
AACCTAATTG CAGCAGGGGA TGAGCTGGAA AAATTCAGCG GTATTCTCTC AGGTTCTCTC
TCCTTTATCT TTGGTAAATT GGATGAAGGC ATGACCTTGA GCCAAGCAAC GCAGCTTGCT
AAAGAGAAAG GCTTTACAGA GCCAGATCCG CGCGATGACC TCTCTGGTAT GGATGTGGCG
CGTAAGTTAC TGATTTTGGC GCGTGAAGCG GGTTACGAGT TGGAGCTGAG CGATGTGGAT
GTTGAACAAG CTCTGCCCGC TGGCTTTGAT GCTTCAGGCA GTGTTGAGGA GTTCATGGCT
CGTTTAGCGC AAGCCGATGC CGCGTTTGCT GAGCGAGTAG CGCAAGCCAA AGCCGAAGGT
AAAGTGCTGC GCTATGTGGC GCAAATCGTC GATGGCCAGT GCCAAGTGCG GATTGTCGCG
GTTGATGAAA ACGATCCTAT GTTCAAAGTC AAAGAAGGTG AAAACGCGCT GGCTTTCTAC
AGCCGTTACT ATCAGCCAAT CCCATTGGTA TTACGTGGTT ACGGCGCGGG TTCTGAAGTG
ACTGCTGCAG GCGTGTTCTC AGATGTGATG CGTACACTCG GTTGGAAATT AGGGGTTTAA
 
Protein sequence
MRVLKFGGSS LADPERFLRA ADIIANNAQQ EEVAVVLSAP GKTTNKLVAV IESALRQGEV 
EAQIVELESS FYALLDGIKA QLPNLDDSAY QQQVHSSMTQ LRQFVHGITL LGMCPDNVNA
RIISKGERVS IQLMKAVMEA KGLPANLVDP VKYLLAKGDH LEAMVDVEIS TQRFRQAPLP
QQHVNIMPGF TAGNAQGELV CLGRNGSDYS AAVLAACLRA DCCEIWTDVD GVYNCDPRLV
DDARLLKSLS YQEAMELSYF GASVLHPKTI APIAQFQIPC LIKNSFNPQG AGTLIGQDTG
EDKLAIKGIT TLSNLTMVNV SGPGMKGMVG MASRVFGAMS AADVSIVLIT QSSSEYSISF
CIEAQHKALA QQALADAFEL ELKDGLLEPV EFVDNVAIIT LVGDGMRTSR GVASQFFSSL
AEVHVNVIAI AQGSSERAIS AVIPDDKISE AIKACHENLF NSKHFLDVFV VGVGGVGGEL
VDQIQRQQAK LAEKGIMMRV CGLANSKGML LDSQGLPLEQ WRDRMVNADQ AFSLENLVAL
VQRNHIINPV LVDCTSSDEI ANQYADFLAA GFHVVTPNKK ANTASMSYYH QLRNVARHSR
RKLMYETTVG AGLPVIENLQ NLIAAGDELE KFSGILSGSL SFIFGKLDEG MTLSQATQLA
KEKGFTEPDP RDDLSGMDVA RKLLILAREA GYELELSDVD VEQALPAGFD ASGSVEEFMA
RLAQADAAFA ERVAQAKAEG KVLRYVAQIV DGQCQVRIVA VDENDPMFKV KEGENALAFY
SRYYQPIPLV LRGYGAGSEV TAAGVFSDVM RTLGWKLGV