Gene Aave_4678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_4678 
Symbol 
ID4669549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp5221698 
End bp5223368 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content44% 
IMG OID639825874 
ProductO-antigen polymerase 
Protein accessionYP_972987 
Protein GI120613309 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGCCTA TGGAGTTATC ACTTGCCCTA ACAGGTTTGG CTTGGTTGGT GCCTAATCAC 
TATGGGCCTT GGTCTTCTGC TTGGAATGAG GCTCTAGCCA TATTTGGGTG TATTTTTGCC
TTAATAAGCA TACTGATTCG GGGCTATCCA TGGGGTGTTT CAAGAGTGCT GATAACTTTA
TCCGCGACTT GTTTTGCTTC CATCGGTTTT CAAGTTTTCA CTGGAAAATT GCTATTCTTA
GGCGATGGCC TGATGGCGGC ATTTTATGTG TGTATATGGT TGTCTGCTAG CGCCATAGGC
GCCGCTATTT ATTTTAGTGA AAAAAAACAG GAGGCTATAA GTGTTCTGGC CTTTATTTGG
CTAGCTGGGG CAATTCTGTC AATAGCCGTT GCATTGGTTC AGTGGACGGG AGCATTCAGC
CTTGGCATCT ATGGTGTGGA TCTTCCGCCA GGCGCGCGCC CATTTGGAAA CGTTGCGCAA
CCTAATCATC TCTGCACAAT AGCTTTTATG GGAATTTGTG GAGCTGTATG GTTGTACCAG
AGACGTTTGA TAGGGATTTC ATCCTTTTGG TTGGCGGTTA TCTGTTTGAT CTTTGGAATG
GTTTTATCTC AATCACGCAC TGGTTGGTTG CAGATTGCTT GGTTTGGATT ATTTTTGATC
GCAATCCAAG GTTGGGTAAA TATTAGGATA AGAAGGTTGG AAGTATTTTT TATAATTGTT
TTCTACTTTT TTTTGGTATC GAATTTCGAT AAGCTGTCTG CTGGGCTGCT TATCCCTAGT
GCCCGCTCTA TTTCAGATCA AGTTCAGCCA GGACTGCGCA TTCCTTATTG GCTTTCCATG
CTTGATGCTA TTTCTAGAGA GCCGCTTTGG GGTTATGGTT GGCTGCAAAT AGGTGTGGCT
CAACAGACCG TCGCCTTAAG CTATTCGGGA TTCGGCTCAC TTTATGAACA TGCTCACAAT
TTTGTTCTGG ATGTAATTCT CTGGAACGGC CTGCTGTTTG GTGGATTTAT ATTAGCTTTG
GCGTTGTTGT GGATTTGGCA GGTGAGATCG GCTGCTTTAG ATCGTTCCAT ATTTTGGCTT
GCTATGGCAC TGGGCGGTAT ATTTATACAT GGAATGTTAG AGTTTCCTTT GGAGTACTCA
TATTTCCTGA TACCTGTTGG TTTGATTATG GGCGCAATGA GCGCTTCCCA GTCTGAAAAC
GGCCAATTGA TATTTTTTGC GAGGAACGTG GCTCTCTTGT TTTTGGTGGG ACTGTCAATC
GCTTTTTTTC TGATAGTAAA AGATTATTTT ACGGCAGAAG AGAATTATCG CCAATTACGT
ATAGAGTCCG CTAGAATTGG TACCGATAGA ATCACTTCGC CGGCACCAGA TCTTTTGCTT
CTAAATCAAC TCGAGGCATT TTTGAGGTTT GCTCGAATCG AAGCGAAGCC TGGCATGTCG
GCCAGTGAAT TGGATTTTAT GAAGGCGGTC GCGGAACGAT ATGGGTATCC CCCCGTACTT
TTCCGTTATG CATTGGCTTT GGGCATCAAC GGAAAAGGAG AAGTTGCTCG GGATATTCTG
GAGAAGATAT GCAGAATCCA TGAAAATCAA CGTTGCGAAG AGGCCGTCGA AGGGTGGAGA
GTGATGGAAA ATAAATATCC AGAACTCAAA GTTCTTCGCT GGAGGAGTTA G
 
Protein sequence
MPPMELSLAL TGLAWLVPNH YGPWSSAWNE ALAIFGCIFA LISILIRGYP WGVSRVLITL 
SATCFASIGF QVFTGKLLFL GDGLMAAFYV CIWLSASAIG AAIYFSEKKQ EAISVLAFIW
LAGAILSIAV ALVQWTGAFS LGIYGVDLPP GARPFGNVAQ PNHLCTIAFM GICGAVWLYQ
RRLIGISSFW LAVICLIFGM VLSQSRTGWL QIAWFGLFLI AIQGWVNIRI RRLEVFFIIV
FYFFLVSNFD KLSAGLLIPS ARSISDQVQP GLRIPYWLSM LDAISREPLW GYGWLQIGVA
QQTVALSYSG FGSLYEHAHN FVLDVILWNG LLFGGFILAL ALLWIWQVRS AALDRSIFWL
AMALGGIFIH GMLEFPLEYS YFLIPVGLIM GAMSASQSEN GQLIFFARNV ALLFLVGLSI
AFFLIVKDYF TAEENYRQLR IESARIGTDR ITSPAPDLLL LNQLEAFLRF ARIEAKPGMS
ASELDFMKAV AERYGYPPVL FRYALALGIN GKGEVARDIL EKICRIHENQ RCEEAVEGWR
VMENKYPELK VLRWRS