Gene Aave_3478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_3478 
Symbol 
ID4665504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp3838572 
End bp3840233 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content67% 
IMG OID639824673 
ProductYD repeat-containing protein 
Protein accessionYP_971808 
Protein GI120612130 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.982216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGCA TGCTGCAGGC CGGCAATGGC GATGCCCGCG TACACTTCAC CTACGACCCG 
CTCTCGCGCC TGACCGAAGA GGTGCAGGAG CATCTGGCGC CGGAAGGGGC GGGCACCCTG
GGCCGCTATG CCTGGCGCCA TGCCTACGAC GCGCTGGGCA ACCGCACGGA CAGCCGCATG
CCCGGCGGCC GGCACGTGCA ATGGCTCTAC TACGGCTCGG GCCACCTGCA CCAGATCCGG
GTGGACGGGC ACGTGCTCTC GGACATCGAG CGCGACGCGC TGCACCAGGA GATCGAGCGC
ACGCAGGGTG CGCTGGAGAG CCGCTACGGC TGGGATCCGA TGGGCCGGCT GGTGGCGCAC
AAGGTGGGCC GGCGGCAGGC GCTGCAGGGC CAGCCGGGCA TGCCGTCGCC ACAGGCCCTG
CGGGACCGGG ACCAGGGCCT GCCCGCATTG CCGCAGCTTC CTTCGGGCGA CCGCATCGCG
CGGCAGTACC GCTACGACCC GACGGGCCAC CTGATCGCCA CGCGCGACGG CCTGCGGGGC
GAGAGCCACT ACCGCTACGA CCCGCTGGGC CGCATCCTGG CGGCGCAGCG CGGGGAAGGC
AAACAACAGC CCACCGAGCG CGAGACCTTC GCCTTCGACC CGGCCGGCAA CCTGCTGAAC
CCGAACCGGG GCGGCGCGCA ATCCAGCGGC GGCGTGGGCC AGCGGGACGT GGTGCCATAC
AACCGGCTGG CGGTGTACCA GGACCTGCGC TTCACCTACG ACCTGCACGG CAACACCATC
GAGCGCCGCA TCGGCTGGCA CACGGTGCAG CACTACCGCT ACAGCCCGGA GCACCAGATC
GTGGAGGCGC GCGTGGTGCG GTATCGCGAA CGGCCCGCCG AAGGGCAAGC GGAGCCTGCG
GCCACGGAGC AGGTCACCCA CTACCGCTAC GACGCCCTGG GCCGGCGCAT CGACAAGCGG
GACGCGTTCG GGCGCACGGT GTTTCTCTAC GACGGGGACC TGCTGGCGGG CGAACTGCGC
GGCAGCAGGC TGTCGGAATA CCTCTACGAG CCGGACAGCT TCGTGCCGCT GGCGAAGCTG
GAATCGGAGT GGAAGGGTGA GGCCGCAGAC AAAGAAAGGG ATGAGGACAA GGAGTCGGCA
CGACCCAAGG ACTTCGCTGC CTACTACTAC CAATGCGACC AGATCGGCGC GCCGCAGGAG
CTGACGGACG AGCAGGGCCG CATCGTGTGG GCGGCGAGCT ACCAGGTGTG GGGTCAGACG
CGGGCGCTGC AAGTCATGCG CACGGGCACG GACGACGCGG CGGTGTTCAC CCAGGCGGAG
CGGCCTTTGG CGCTGGCGGC GAAGGGGGAT GTGCAGGCGC TGAGCTTCGT GGAGCAACCG
CTGCGGTTCC AGGGGCAGTA CTTCGATGGC GAGACGGGAC TGCACTACAA CCGGTTTCGG
TACTACGATC CGGTGACGGG GCGGTTCGTG CATCAGGATC CGATTGGGTT GTTGGGTGGT
ACCAATCTTT TTACCTATGC TCCAAATGCA TTTAATTGGG TGGATGAGTA TGGATTGCAA
AGAAAGAACA AAAGTTGCTC AATTTGTGGT TCGCAAAGAT GTAAGACGCT GGCAGATTGG
CTGAAGGATT ATCCGGAAAT ATTGAAAGAG GCCAGAGAAT AA
 
Protein sequence
MGRMLQAGNG DARVHFTYDP LSRLTEEVQE HLAPEGAGTL GRYAWRHAYD ALGNRTDSRM 
PGGRHVQWLY YGSGHLHQIR VDGHVLSDIE RDALHQEIER TQGALESRYG WDPMGRLVAH
KVGRRQALQG QPGMPSPQAL RDRDQGLPAL PQLPSGDRIA RQYRYDPTGH LIATRDGLRG
ESHYRYDPLG RILAAQRGEG KQQPTERETF AFDPAGNLLN PNRGGAQSSG GVGQRDVVPY
NRLAVYQDLR FTYDLHGNTI ERRIGWHTVQ HYRYSPEHQI VEARVVRYRE RPAEGQAEPA
ATEQVTHYRY DALGRRIDKR DAFGRTVFLY DGDLLAGELR GSRLSEYLYE PDSFVPLAKL
ESEWKGEAAD KERDEDKESA RPKDFAAYYY QCDQIGAPQE LTDEQGRIVW AASYQVWGQT
RALQVMRTGT DDAAVFTQAE RPLALAAKGD VQALSFVEQP LRFQGQYFDG ETGLHYNRFR
YYDPVTGRFV HQDPIGLLGG TNLFTYAPNA FNWVDEYGLQ RKNKSCSICG SQRCKTLADW
LKDYPEILKE ARE