Gene Avi_1667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_1667 
Symbol 
ID7386679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp1394574 
End bp1395788 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content58% 
IMG OID643650996 
Producthypothetical protein 
Protein accessionYP_002549200 
Protein GI222148243 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGATC CGAAACGGCT GACTGAAACG GCCAGCGGTC AAACTTTTGT GGCAGATATC 
GTCGTGGACG TTCTGCCCGT GATGGAGCCG CTGGAGGCCG ACTGGCGCTG TCTGGAGCGC
AACAACCATC TGTCGCTGCA TCAAGGTTAC GACTGGTGCC GCGCCTGGGT GAAAACCCAT
GGCAATCCGC TGGCCATTCT GCATGGCAAC AGCAACGGAC GCAGCCTGTT CATCCTACCG
CTGGAAATCA CCCGTCACGC CATGGTCCGC AAGGCCAGTT TCATCGCCAC CCGCTTCACC
AATATCAATA CCGGCCTGTT CGACCCGGCT TTTCTCCAGC AAATCGAGTC GGACACAGCC
AAACAGTTGG GAAGGCAGAT TGTCCAGGCA ATGACCGGGC ATGCCGATCT CGTTCATCTC
GGCAATAGTC CGCTCTCCTG GCGTGGATTT ACTCATCCGC TGGTCGGCCT ACCGGCTGTC
GAACATCAGA ACCATGCCTT CCAACTGCCG CTTCTAGGCG ACTTCGAACA GACGCTCTCC
CAGATCAACG CCAAGCGGCG GCGCAAGAAA TACCGCAATC AGGTCCGCAA GCTGGAAGCA
AGCGGCGGCT TTGAGCATAT TATTGCGTGC GGTGAGGAGC AGAAAGCCTG GCTGCTGGAT
CTGTTCTTCC GGCAAAAAGC CGTCCGCTTC GAAACCCTCG GTCTGCCGGA CGTGTTTCAG
GAGCCGGAGA CACAGGCATT CTTCCAGTTG CTGCTGCAAA GCGAGGCAGG TGGATTGAAC
GTGCCACTGG AACTGCATGC ACTGCGGTTG TCGGGCAGCC ATCATAACGG CAAGATTGCC
GCCATCGCCG GTCTTTCACG CAAGGGCGAT CACGTTATCT GCCAGTTCGG CTCGATAGAT
GAAAGCATCG CCCCGGAGAC CAGCCCCGGC GAATTGCTGT TCTGGCTGAT GATTGAACAA
TGCTGCGCCG AGGGGGCCGC CCTGTTCGAC TTCGGCCTCG GCGACCAGAT CTACAAGCGA
AGCTGGTGCC CTATGGAAAC CGTGCAACAC GATATTTTGC TGCCCGTGAC CCCTCTGGGG
CACCTCGCAG CCACAGCCGA GCGTAGCCTG ACCCGCTCCA AGGCGTTCAT CAAGGGGCAT
CCCCAACTCT ATAGCGCGCT GCAAAAACTC CGCGCTCGTA GCGATGCGCA AGCACATGAT
CAGGGTAAGG AATGA
 
Protein sequence
MIDPKRLTET ASGQTFVADI VVDVLPVMEP LEADWRCLER NNHLSLHQGY DWCRAWVKTH 
GNPLAILHGN SNGRSLFILP LEITRHAMVR KASFIATRFT NINTGLFDPA FLQQIESDTA
KQLGRQIVQA MTGHADLVHL GNSPLSWRGF THPLVGLPAV EHQNHAFQLP LLGDFEQTLS
QINAKRRRKK YRNQVRKLEA SGGFEHIIAC GEEQKAWLLD LFFRQKAVRF ETLGLPDVFQ
EPETQAFFQL LLQSEAGGLN VPLELHALRL SGSHHNGKIA AIAGLSRKGD HVICQFGSID
ESIAPETSPG ELLFWLMIEQ CCAEGAALFD FGLGDQIYKR SWCPMETVQH DILLPVTPLG
HLAATAERSL TRSKAFIKGH PQLYSALQKL RARSDAQAHD QGKE