Gene Avi_7168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_7168 
Symbol 
ID7380333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011981 
Strand
Start bp135594 
End bp136652 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content57% 
IMG OID643641281 
Producthypothetical protein 
Protein accessionYP_002539578 
Protein GI222102539 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.04303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTTC AACAAAAAAT CGCAACCCTG CCCTTTGTGG ATATTGGCCG CTTCCATGCG 
GGACCAGAGG AGCGCGCAGC CTTCATCGCA GACCTGCGTC GCATTCTGTT TGATCACGGC
TTTTTCTATC TCACCGGCCA TGGCGTTGAT CCAAAGCTGA TTGCGGATGT GCTCGAGACC
GCCAAACGCT TTTTTGCGCT GCCGCTTGAG GAAAAGCTGA AGATTGAAAT GGTGAAATCC
CGGCACTTTC GCGGCTACAA TCGTGCGGGC TATGAGCACA CCCGTGGTCA GCAGGATTGG
CGCGAACAAC TGGATATAAA TACGGAAGGC ACGCCTGTCG AGATTGGCCC GGAAACACCT
GCGTGGAAAC GTTTGCTCGG GCCAAATCAA TGGCCAGAGG CTATTCCAGA ACTAAAGCCC
CTGCTGCTGA CCTATCAGGC AGAAGTCACC TGCGTTGGCA TTGATGTTTT GAAGGCCATT
GCCGTGGCGC TTGACCAGCC GGAAGATGTG TTTGCGCAGA TCTACGAGCC GCAACCATCG
CAACTGTTGA AAATCATTCG CTATCCCGGG CGGGATGTGG CTGAGACAGA TCAGGGCGTT
GGTGCCCACA AGAACGGCGG CTTCGTCACG GTTCTTTTGC AAGACAAGGT CGAAGGTCTA
CGGGTGCAGA CTGAAGACGG CGTGTGGCTG GATGCTCCGC CCGTACCGGG CACCTTCGTG
GTTAACACCG GGGAATTGCT GGAACTGGCC ACCAATGGCT TCGTGCGGGC CGACGTGCAT
GATGTGGTTG CACCGCCTGC CGGTATCGAG CGCTTCTCCG TCGCCTTCTT CTTAGGCTCG
CGCTACGACG CAACGATTCC GGTGATTACG CTTCCAGACG AGCTGCATCG AAAAGAGCGC
GGCATCACGG TTGATCTGCT GAACCCGATC TTTCGGGAAG TTGGCCAGAA CCATCTCAAA
AGCCGCCTGC GGTCGCACCC CGATGTTGCC CGCGCCCACC ACGCTGATTT GCTCACGCCT
GAGCAATTGG CCGGACAGGC GGTAGCGCAG GCCTATTAA
 
Protein sequence
MTLQQKIATL PFVDIGRFHA GPEERAAFIA DLRRILFDHG FFYLTGHGVD PKLIADVLET 
AKRFFALPLE EKLKIEMVKS RHFRGYNRAG YEHTRGQQDW REQLDINTEG TPVEIGPETP
AWKRLLGPNQ WPEAIPELKP LLLTYQAEVT CVGIDVLKAI AVALDQPEDV FAQIYEPQPS
QLLKIIRYPG RDVAETDQGV GAHKNGGFVT VLLQDKVEGL RVQTEDGVWL DAPPVPGTFV
VNTGELLELA TNGFVRADVH DVVAPPAGIE RFSVAFFLGS RYDATIPVIT LPDELHRKER
GITVDLLNPI FREVGQNHLK SRLRSHPDVA RAHHADLLTP EQLAGQAVAQ AY