Gene VC0395_A2217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2217 
SymbolargH 
ID5136099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2366192 
End bp2367568 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content50% 
IMG OID640533673 
Productargininosuccinate lyase 
Protein accessionYP_001218133 
Protein GI147673344 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTAT GGGGCGGAAG ATTTACCCAA GCGGCAGACA GTCGGTTCAA ATCATTCAAC 
GATTCCTTAC GTTTTGATTA TCGATTGGCT GAGCAAGATA TTGTTGGATC AATCGCTTGG
TCAAAAGCCT TGGTCTCCGT CAATGTCTTG AGCGTGCAGG AGCAACAACA GTTAGAGCAG
GCGTTAAATC ACCTATTGCA ATCGGTTCAG CAAGATCCGG AGCAAATCTT AGCCTCTGAT
GCAGAAGATA TTCACTCTTG GGTCGAGCAG AAGCTGATTG AGCAAGTCGG TGATCTTGGT
AAAAAACTGC ACACAGGGCG TTCACGTAAT GATCAGGTCG CAACCGATCT TAAGCTCTGG
TGTCGCGATC AGGGCGTTCA TCTGCTGCTT GCTCTGAAAA CGCTGCAACA ACAGTTGGTG
GCGGTGGCTG CTGAACATCA ATCGACAGTT TTGCCGGGCT ACACCCATTT GCAACGCGCG
CAACCAGTAA CTTTTACTCA TTGGTGTTTA GCTTATTTGG AGATGTTTGA GCGTGATGAG
TCCCGTTTGA CTGATGCTCT AGCGCGTTTA AACACCTCAC CACTGGGTTC AGGTGCATTA
GCGGGAACCG CTTACGCGAT TGATCGTGAG GTGTTAGCGG CGGATCTTGG TTTCACTCGC
GCAACACGTA ACTCGCTGGA TGCGGTCTCC GATCGCGATC ATGTGATGGA GCTGATGTCA
GTCGCATCGA TCTCTATGCT GCATTTGTCG CGTCTGGCGG AAGATATGAT CTTCTACACC
ACTGGTGAAG CGGGCTTTAT TGAATTGGCT GATACGGTGA CCTCTGGTTC TTCACTGATG
CCACAAAAGA AAAACCCCGA TGCACTTGAA TTGATCCGCG GCAAAACCGG ACGCGTCTAT
GGTGCATTGG CTGGGATGAT GATGACAGTC AAAGCTCTGC CTCTCGCGTA CAACAAAGAC
ATGCAAGAAG ACAAAGAAGG GCTGTTTGAT GCGCTCGACA CTTGGTTTGA TTGCTTGCAA
ATGGCGGGAC TTTGCTTTGA TGGCATTAAA GTCAATGCGG CGCGTACGTT AGAAGCGGCC
AAGCAAGGCT ACTCGAACGC GACTGAATTG GCGGATTATC TGGTTGCGAA AGGCATTCCA
TTTCGCGAGG CGCACCATAT TGTCGGGGTA GCGGTAGTCG CAGCTATTGG CAAGGGTGTA
GCGTTGGAAG AGTTGTGTCT GGCGGAGCTG CAACAGTTTT CGCCTTTGAT TGAGCAGGAT
GTCTATCCGA TCCTGACTAT TGAGTCTTGT TTAGAGAAAC GCTGCGCACT CGGTGGGGTA
TCGCCCAAAC AAGTGGCGCA TGCTCTTCAG CAAGCTCAAG CGCGCGTGAA GTCTTAA
 
Protein sequence
MALWGGRFTQ AADSRFKSFN DSLRFDYRLA EQDIVGSIAW SKALVSVNVL SVQEQQQLEQ 
ALNHLLQSVQ QDPEQILASD AEDIHSWVEQ KLIEQVGDLG KKLHTGRSRN DQVATDLKLW
CRDQGVHLLL ALKTLQQQLV AVAAEHQSTV LPGYTHLQRA QPVTFTHWCL AYLEMFERDE
SRLTDALARL NTSPLGSGAL AGTAYAIDRE VLAADLGFTR ATRNSLDAVS DRDHVMELMS
VASISMLHLS RLAEDMIFYT TGEAGFIELA DTVTSGSSLM PQKKNPDALE LIRGKTGRVY
GALAGMMMTV KALPLAYNKD MQEDKEGLFD ALDTWFDCLQ MAGLCFDGIK VNAARTLEAA
KQGYSNATEL ADYLVAKGIP FREAHHIVGV AVVAAIGKGV ALEELCLAEL QQFSPLIEQD
VYPILTIESC LEKRCALGGV SPKQVAHALQ QAQARVKS