Gene Aave_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_1997 
Symbol 
ID4667244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp2169435 
End bp2170517 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content74% 
IMG OID639823208 
Productprotein of unknown function DUF513, hemX 
Protein accessionYP_970355 
Protein GI120610677 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2959] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.178831 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00087964 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTTCCG AGCCCACCCA CGCACCTTCC GCGCCTCCCG CAGGCCCGGC GCCCCAGGCC 
TCCGGCGTGC AGCGCGCCGT GCTGACGCTG CTCGGCGTCG TGGCCGTCGC AGGCCTCGCG
ACCAGCGTCA TGCTGTGGCA GCGCCTCGGC AGCATCCAGG AACAGCTCGC GCGCCAGTCG
GCCGATGCCG GCGCGCAGTC CATCGAGGCC CGCACCCTCG CCAACCAGGC GCTGGACATG
GCGCGCGACG TCTCGGCGCG CATCGCCGTG AACGAAACCC GCGTGAGCGA AGTCGCCCTG
CAACGCAGCC AGCTCGAGGA ACTCATGCAG AGCCTCTCGC GCTCGCGCGA CGAGAACCTG
GTGGTGGACA TCGAATCCGC GCTGCGCCTC GCGCAGCAGC AGGCCCAGCT CACCGGCAGC
CTGGAGCCCG TCATGGCCGC GCTCAAGAGC GCCAGCCAGC GCATCGAGCG GGCCGCGCAG
CCGCGCCTGG CCCCCGTGGC GCGCGCCATC GGCCGCGACC TCGACCGCGT GGGCTCGGCG
CAGGTCACCG ACACGGCCGG CCTGCTGGCC CGCCTGGACG ACCTCATGCG CCAGGTGGAC
GAACTGCCCC TGCAGAACGC CGTGGCCCAG GCCGCGGCCA CGCGGCGCAT GAATGCCGCC
GCCCGCCCGT CCGAGGGCCC GGCCGCGCCC GGAGCCGACG GCGCCCTGCC CTGGTGGCAA
GCCGCGCTGC AGCGCGGCTG GGAAGTCGTG CGCGACGAGG CCCGCCAGCT GCTGCGCGTC
ACCCGCATCG ACCGCCCGGA AGCCATCCTC ATCGCCCCCG ATCAGGCCTT CTTCCTGCGC
GAGAACCTCA AGCTCCAGCT GATGAACGCC CGCCTCGCGC TGCTGGCGCG CCAGTACGAA
TCGGCGCGCG CAGACCTCTC CGCCGCCAAC AACGCCCTGG GCCGGTACTT CGATCCCGCA
TCGCGCCGCA CGCAGACAGC GGCCACGGTG CTGCAGCAGG CGCAGGTCCA CCTCAAGGGC
GCCGCCCTGC CCACGCTGGA CGAAACCTTC GCCGCGCTGG CCACCGCCGC CGCCGGCCGC
TGA
 
Protein sequence
MSSEPTHAPS APPAGPAPQA SGVQRAVLTL LGVVAVAGLA TSVMLWQRLG SIQEQLARQS 
ADAGAQSIEA RTLANQALDM ARDVSARIAV NETRVSEVAL QRSQLEELMQ SLSRSRDENL
VVDIESALRL AQQQAQLTGS LEPVMAALKS ASQRIERAAQ PRLAPVARAI GRDLDRVGSA
QVTDTAGLLA RLDDLMRQVD ELPLQNAVAQ AAATRRMNAA ARPSEGPAAP GADGALPWWQ
AALQRGWEVV RDEARQLLRV TRIDRPEAIL IAPDQAFFLR ENLKLQLMNA RLALLARQYE
SARADLSAAN NALGRYFDPA SRRTQTAATV LQQAQVHLKG AALPTLDETF AALATAAAGR