Gene VC0395_A2842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2842 
SymbolarcA 
ID5136427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2995025 
End bp2996248 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content51% 
IMG OID640534286 
Productarginine deiminase 
Protein accessionYP_001218692 
Protein GI147675433 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2235] Arginine deiminase 
TIGRFAM ID[TIGR01078] arginine deiminase 


Plasmid Coverage information

Num covering plasmid clones67 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTT TGTATGTGGG CTCTGAAGTG GGGCAACTGC GCCGTGTATT GCTAAATCGT 
CCGGAGCGGG CACTCACCCA TTTAACCCCC TCTAACTGCC ATGAACTCTT GTTTGATGAT
GTACTGGCCG TTGAAGCGGC GGGCGTTGAG CACGATGCCT TTGCCAATAC GTTGCGCACG
CAAGATGTGG AAGTGTTACT GCTGCACGAT TTACTCGAAG AGACACTCGC CATTCCTGAA
GCCAGACAGT GGCTACTGAA CACTCAGATC AGCGATTTTC GCTTTGGTCC AACCTTTGCT
CGCGAGCTGC GCCACGCTCT AAACCATCTT GATGATCATC ATCTCACCAC GCTTTTACTG
GGCGGGCTCG CTTTTTCTGA ATTGCACCTC GAATCGGATT CCATGCTACC GAAAATGCGC
CAGCCACTCG ATTTTGTGAT TGAGCCGCTG CCCAATCACC TGTTTACTCG TGATACCTCC
TGCTGGGTGT ATGGAGGGGT GTCACTCAAT CCAATGATGA AACCGGCTCG CCAGCGTGAA
ACCAACCATT TGCGCGCGAT TTATCGCTGG CACCCGATTT TTGCCCAGCA TCCTTTTATC
CACTATTTCG GTATTGATGA TCTGCACTAC GACAACGCCA ATATAGAGGG TGGTGATGTG
CTGGTGATCG GCAAAGGCGC GGTGTTGATT GGAATGTCTG AACGCACTTC ACCACAAGGA
GTGGAAAATT TAGCGGCCGC ACTCTTTAAA CATGGTCAAG CCAGCAAAGT GATTGCGATC
AACCTGCCCA AACATCGCTC GTGCATGCAT TTAGATACGG TGATGACTCA TATGGATGTG
GATACTTTTT CCGTTTATCC AGAGGTCATG CGTAAAGATC TGCCGACTTG GCGACTCACC
CCCAAGGGCA ATAACGGCGA TATGCGCGTT GAGCAAGTCC CCAGCTATTT ACACGCCATT
GAGCAAGCAC TTGGGGTCGA TTATTTGAAA ATCATCACCA CAGGTGGCAA CAGTTACGAA
GCCGAACGCG AGCAGTGGAA TGACGCCAAT AATGTCCTCA CTGTCAAACC TGGGGTAGTG
ATTGGCTACG AACGCAACGT TTATACCAAT GAGAAATACG ATAAAGCAGG CATTAAGGTA
CTGACCATTC CCGGCAATGA GCTAGGACGA GGCCGCGGCG GCGCACGCTG TATGAGCTGC
CCAATTGAGC GTGACGGGAT TTAA
 
Protein sequence
MNRLYVGSEV GQLRRVLLNR PERALTHLTP SNCHELLFDD VLAVEAAGVE HDAFANTLRT 
QDVEVLLLHD LLEETLAIPE ARQWLLNTQI SDFRFGPTFA RELRHALNHL DDHHLTTLLL
GGLAFSELHL ESDSMLPKMR QPLDFVIEPL PNHLFTRDTS CWVYGGVSLN PMMKPARQRE
TNHLRAIYRW HPIFAQHPFI HYFGIDDLHY DNANIEGGDV LVIGKGAVLI GMSERTSPQG
VENLAAALFK HGQASKVIAI NLPKHRSCMH LDTVMTHMDV DTFSVYPEVM RKDLPTWRLT
PKGNNGDMRV EQVPSYLHAI EQALGVDYLK IITTGGNSYE AEREQWNDAN NVLTVKPGVV
IGYERNVYTN EKYDKAGIKV LTIPGNELGR GRGGARCMSC PIERDGI