Gene Avi_5420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5420 
Symbol 
ID7381519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp421282 
End bp422394 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content60% 
IMG OID643649027 
Productlipoprotein 
Protein accessionYP_002547264 
Protein GI222106473 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.406883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGACC TGCGCAGCCT TTCCCGTCGC GGTTTTCTCA ACATGGCGGC AGCCGGCAGC 
GCCTCACTGG CGCTTCCCGG CATGGTGCGC TCTGCCTCAG CCGCCAATAC CCTGGCGGCC
ATAAAGGAAG AGGACGCCGT GATTGGCTTC GGCCATGTCG GCCCGGTGAC GGATGAAGGC
TGGACCTGGT CGCACCATCA GGGTGTTCTC GCCGTCAAGG AAAAATTCCC CAAGCTGAAG
AAGATCCTCG AGGTCGAGAA CGTTCCCTAT TCAGCGGATG CGACCCGCAC CTATCGGCAA
TTCGTGTCGG AAGGCGCGAA CATGATTTTC GATACGTCGT CTACCGGCGA CTTCCTGCAT
GACGTGGTGC GCCGCGCCAA AGACACCGCC TTCATGGAGT GCAATGGCCA TGTGACGATG
GACAATCTCG GCTGGTATTA TATGGCCCAT TGGTATCCAA CCTATGTGGT CGGCGTCGCC
GCAGGGCATC TGTCGAAAAC CGGCAAACTC GGTTACGTCG CCTCCTTCCC GGTTGCTTCG
GTCTATGCCT CGACCAACGC CTTCCTGATG GGCGCGCGCA CCGTCAACCC CAATGCCACC
TGCCAGACCA TCACCATCAA TTCCTGGTTC GATCCGCAGG CCGCCGCCCA GGCTGGCACC
GCGCTGATCG ACAATGGCTG CGATTTCCTG TTCGGCATCA TGGATGAGGC CGCCTATCTT
CAGGTCGCCG AAAAACGCGG CGTCTGGGCT GCGATGTGGA ACACCGACAT CCGCCGCTAT
GGCCCGAATT CCTACGTGTC TTCGATCATT ATCGACTTCA AGGAGTTCTA TATCGATCAG
GTCCGCAAGC GGCTGGCAGG CGAATGGTCG CCTTCGGAAA GCATCTTCGC CATGGGCGCA
GGCGTTGACC GCGATAGCTG GGGCGCCAAG GTTCCCGCCG AAGTCGGCAA GGCGGCAGAC
GATATACGCA CGAAAATCCT GGGCGGCTGG TCGCCGTTTG TCGGCGAATT GAAGGACGCC
AAGGGCGCTG TGCGGGTGGC CAAGGGCCAG AAGATGACCG AACTCGAGCT TTATAATTGG
GATTGGTCAG TGGAAGGCGT CACGGGGCTT TAA
 
Protein sequence
MIDLRSLSRR GFLNMAAAGS ASLALPGMVR SASAANTLAA IKEEDAVIGF GHVGPVTDEG 
WTWSHHQGVL AVKEKFPKLK KILEVENVPY SADATRTYRQ FVSEGANMIF DTSSTGDFLH
DVVRRAKDTA FMECNGHVTM DNLGWYYMAH WYPTYVVGVA AGHLSKTGKL GYVASFPVAS
VYASTNAFLM GARTVNPNAT CQTITINSWF DPQAAAQAGT ALIDNGCDFL FGIMDEAAYL
QVAEKRGVWA AMWNTDIRRY GPNSYVSSII IDFKEFYIDQ VRKRLAGEWS PSESIFAMGA
GVDRDSWGAK VPAEVGKAAD DIRTKILGGW SPFVGELKDA KGAVRVAKGQ KMTELELYNW
DWSVEGVTGL