Gene Avi_3722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_3722 
SymbolkpsF 
ID7388192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp3085833 
End bp3086828 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content59% 
IMG OID643652507 
Productcapsule expression protein 
Protein accessionYP_002550689 
Protein GI222149732 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGC TGGCTGTAAA ACTCGTGGAA GCAAGCGCCA TCAAGGCTGC ATTGCGTGTC 
GTCGCTACCG AACAAAGCGG GCTGGCGGCC CTGGAAGAGG CTTTGGCAGG GTATCTTGCC
GGGCCGTTTT GCAACGCAAT CGATGTTATC GGCAAAAGCT CGGGCCGGGT GATCGTCTCT
GGCGTCGGCA AGAGCGGCCA CATCGGCGGC AAGATCGCGG CAACGTTCGC CTCGACAGGC
ACACCGGCTT TCTTCATCCA TCCGGCGGAA GCCAATCATG GCGACCTCGG CATGATTGCC
CGTGACGATG TGGTGATTGC CCTGTCCTGG GGCGGCGAAA GCACCGAGCT GAACGGAATC
CTGTCCTTCA CTCGCCGGTT TTCCATTCCA CTGATAGCCA TTACCGCCGG TGAGCAATCC
ACGTTGGCAC GTGAAGCCGA TATCGTGCTT TTGATGCCCA AGGTGCAGGA AGCCTGCCCG
CATGGCTTGG CGCCGACCAC CTCCACCATG ATGCAGATGG CGCTGGGCGA TGCGCTGGCG
CTGGCGCTGC TGGAAGCTCG TGGCTTCGGG CCGAATGATT TCAAGACCTT CCATCCGGGC
GGCAAGCTGG GCGCGATGCT GACTCATGTC GGCGACATGA TGCATATCGG CGAGGACGTG
CCGCTGGTGC CGGAAGGCAC ATCGGTGCCG GAAGCGATTA TCATGCTATC GCAGAAGCGT
TTTGGCTGCG TCGGCGTCAC CGACAGCGCT AACCGGTTGG TCGGTATCAT CACCGATGGC
GATATTGCCC GTAACCTCAA CCGCAATCTC GGTGAGCGGA TGGTGGAAGA GGTGATGACC
CGTCACCCAA AGACGGTTCA CACCGAAACA CTTGCGACCA CCGCCATGGC GATCCTCAAC
CAGCACAATA TTTCGGCGCT GTTTGTGACC GATGAAGACG GTGTGCCAAA CGGCATCATC
CACTTCCATG ATTTGTTGCG GATTGGTGTG GCTTAA
 
Protein sequence
MNKLAVKLVE ASAIKAALRV VATEQSGLAA LEEALAGYLA GPFCNAIDVI GKSSGRVIVS 
GVGKSGHIGG KIAATFASTG TPAFFIHPAE ANHGDLGMIA RDDVVIALSW GGESTELNGI
LSFTRRFSIP LIAITAGEQS TLAREADIVL LMPKVQEACP HGLAPTTSTM MQMALGDALA
LALLEARGFG PNDFKTFHPG GKLGAMLTHV GDMMHIGEDV PLVPEGTSVP EAIIMLSQKR
FGCVGVTDSA NRLVGIITDG DIARNLNRNL GERMVEEVMT RHPKTVHTET LATTAMAILN
QHNISALFVT DEDGVPNGII HFHDLLRIGV A