Gene Aave_3086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_3086 
Symbol 
ID4665689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp3402278 
End bp3404287 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content68% 
IMG OID639824285 
Productpeptidase U32 
Protein accessionYP_971424 
Protein GI120611746 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.193528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.252382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTCC TGCCCCACCA GCTCGAACTT CTCTCCCCCG CCCGCGACGC CGACATCGGC 
ATCGAGGCGG TCAACCATGG CGCGGACGCG GTCTATATCG GCGGACCGGC CTTCGGCGCG
CGGGCCAGCG CGGGCAACGA ACTGCGCGAC CTGGAGCGGC TCATCCGCCA TGCGCACCGC
TTCCACAGCC GCATCTTCGT CACGCTCAAC ACCATCCTGC GCGACGACGA ACTCGAAGGC
GCCCGCCGCA TGGCCTGGCA GGTGTACGAG GCCGGCGCCG ACGCGCTCAT CATCCAGGAC
ATGGGCCTGC TGGAAATCGA CCTGCCCCCC ATCCAGCTGC ACGCGAGCAC ACAGACGGAC
ATCCGCACGC CCGAGAAGGC TCGCTTCCTG CAGGACGCCG GGCTGTCGCA GATCGTGCTG
GCGCGCGAGC TCACGGTGCA GGAGATCGCG GCCATCCGCG CTGCGACCGA CCCGGAGCGC
TGCACGATCG AGTTCTTCAT CCACGGCGCA CTCTGCGTGG CCTACAGCGG CCAGTGCTAC
ATCAGCCACG CGCATACCGG CCGCAGCGCC AACCGCGGCG ACTGCAGCCA GGCCTGCCGC
CTGCCCTACC AGGTGACGGA CGACGCCGGC CGCTTCATCG CGCACGACAA GCACGTGCTC
TCGATGAAGG ACAACAACCA GTCGGCCAAC CTGCGCCCCC TCATCGACGC GGGGGTGCGC
AGCTTCAAGA TCGAAGGGCG GTACAAGGAC ATGGCCTACG TGAAGAACGT CACGGCCCAC
TACCGCCGGC TGCTCGACGA GATCATCGAG GAGCGCGAGA CCTCCGCCGC GCCCCTGGCC
CGCTCTTCCA GCGGCCAGAC GCGCTTCACC TTCACGCCCG ACCCGAACCA GAACTTCAAC
CGCGAGTTCA CCGACTATTT CGTGAACGGC CGCAAGGAAG ACATCGGCGC GTTCGACACG
CCCAAGAACC CCGGCCAGGC GATCGGCTGG GTCACGCAGG TGGGCCCGAA CTGGGTCGAG
CTGGAAACCC ATGCGCCCGA CACCGTGCTG CACAACGGCG ACGGCTTGTG CTACTGGGAC
CTGCAGAAGG AACTCGTGGG CGTGGCGATC AACCGCGCCG AGGCGGCGCC CGGCAAGTCA
CGCAACCACT GGCGCGTGTT CCCGAAGGAC CCGATGGAGG GTTTCAGGGA CCTGCGCCGC
GGCACCGAGA TCAACCGCAA CCGGGACATG GACTGGGTGC GCACGCTCGA GAAGAAATCC
AGCGAGCGGC GCATCGGCCT GTGGGCGCAT TTCACGGACA CCGACGCCGG CTTCGCCCTC
ACGCTGACCG ACGAGGACGG CTTCACCGGC ACCGCCGAGG TGGCGCATGC GCACGAGCCG
GCCACCGATG CGGGGCGCGC GGAAGCCGCC CTGCGCGAAC AGCTCGGCCG CTTCGGCGCC
ACGATCTTCC ACGCGCACGA CATCGCCGTC GCGATGCGGC AGCCGTGGTT CGTGCCCGCC
TCCGTGCTGA ACCCGCTGCG CCGCGATGCC GTGGCCGCCC TGGAGGCGGC GCGCACCGAG
GGCCTGCGGC GCCTGCCCCG CGCGCAACCC GTGGAGCCGC CGGCGCCCTT CCCCGAGGAC
ACCCTCACCT ACCTGGCCAA CGTGTTCAAC CAGAAGGCGC ATGACTTCTA CATGAAGCAC
GGCGTGAAGG TGATCGACGC CGCCTACGAA AGCCAGGAAG AGGACGGCGA GGTGAGCCTG
ATGATCACCA AGCACTGCGT GCGCTTTTCC ATGAGCCTGT GCCCCAAGCA GGCCAAGGGC
GTGATCGGGG TCAAGGGCAC CATCAAGGCC GAACCCCTGC ACCTGATCAA CGGCAAGGAA
AAGCTCACGC TGCGCTTCGA CTGCAAGCCC TGCGAGATGC ACGTGGTGGG CCGGATCAAG
AAATCGGTGC AGAACCAGCA GGCGCGCGAG GCGCAGGCGG TACCGATGCA GTTCTACCGC
ACGCGGCCCG TGCCCGGCGC GCCGCACTGA
 
Protein sequence
MSLLPHQLEL LSPARDADIG IEAVNHGADA VYIGGPAFGA RASAGNELRD LERLIRHAHR 
FHSRIFVTLN TILRDDELEG ARRMAWQVYE AGADALIIQD MGLLEIDLPP IQLHASTQTD
IRTPEKARFL QDAGLSQIVL ARELTVQEIA AIRAATDPER CTIEFFIHGA LCVAYSGQCY
ISHAHTGRSA NRGDCSQACR LPYQVTDDAG RFIAHDKHVL SMKDNNQSAN LRPLIDAGVR
SFKIEGRYKD MAYVKNVTAH YRRLLDEIIE ERETSAAPLA RSSSGQTRFT FTPDPNQNFN
REFTDYFVNG RKEDIGAFDT PKNPGQAIGW VTQVGPNWVE LETHAPDTVL HNGDGLCYWD
LQKELVGVAI NRAEAAPGKS RNHWRVFPKD PMEGFRDLRR GTEINRNRDM DWVRTLEKKS
SERRIGLWAH FTDTDAGFAL TLTDEDGFTG TAEVAHAHEP ATDAGRAEAA LREQLGRFGA
TIFHAHDIAV AMRQPWFVPA SVLNPLRRDA VAALEAARTE GLRRLPRAQP VEPPAPFPED
TLTYLANVFN QKAHDFYMKH GVKVIDAAYE SQEEDGEVSL MITKHCVRFS MSLCPKQAKG
VIGVKGTIKA EPLHLINGKE KLTLRFDCKP CEMHVVGRIK KSVQNQQARE AQAVPMQFYR
TRPVPGAPH