Gene Avin_35020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_35020 
Symbol 
ID7762397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3575956 
End bp3577254 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content68% 
IMG OID643806368 
Producthemolysin-like protein 
Protein accessionYP_002800626 
Protein GI226945553 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.518958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGCGG GCCTGCTCCT GCTGCTGATC GCGCTCAACG GCGTGTTCGC GATGGCGGAG 
ATCGCCGTGG TTTCCTCGCG CAAGGCCCGC CTGCAGAGGC TGGCCGACGA GCGGCGGCAC
GGCGCGCTGG TCGCCCTGGC GCTACAAGAT GATCCTTCGC GCTTTCTATC CACCATCCAG
GTCGGCATCA CTACCGTGGG CGTTCTCAGC GGCGCACTCG GCGAAACCCT GCTGGCCCGG
CCGCTGACGC AGCGGATCGC CGAACTGCCG GCGCTAGCGC CCTATGCCGA GGCCATCGCC
CTGATCCTCA CCGTGGCGGC GATCACCTAT CTGTCCGTGG TGGTCGGCGA ACTGCTGCCC
AAACGCCTGG CCCTGCTCGC GCCGGAAACC ATCGCCAGCG CGCTCGCACC GGCGATGCGA
CGCCTGGCGC AGATTGCCGC GCCGCTGGTA TGGCTGCTGT CGTATTCCTG CAATCTGTTG
CTGCGCCTGA TCGGCATCGG ACGTCGCGAC GAACCGCCGA TCACCGACGA GGAGATCCAG
GTGCTGATGG AGCAGGGGGC CGAGGCCGGG GTTTTCCACG AGAGCGAGCA AGCGTTCGTG
GCCAATGTCC TGCACCTGGA CGAGCAGCCG GTCGGCGCGA TCATGTCGCC CCGCCAGCAG
GTCTACGCCA TCGATCTGGA CGATCCGCGG GAGGAGCAGC TCAGGCGGCT CGCCGAGAGC
CCCTATACCC GTGTCCTGGT ATGCCGGGGC GGCCTGCATC GGCTGGTCGG TGTGCTGCAT
CGCGGCGATC TGCTCAAGCC GGCCCTGCAA GGGCAGCCGA TCGATCTGCA GCGGAGCGCC
CGGCCGCCGT TCTATGTGGA GGAAAGCGCG AGCAGTACCG GCCTGCTGGA GGATTTCCGC
AGGACGCGCA ACGAGTTCGC GGTGGTCGTC GACGAATTCG ACGACCTGCA AGGCATCGTC
ACCCTCAGGG ACGTGCTCAC CGCGATCGTC GGCGAGATTC CCGATGCACT GCACGACGGC
GAGCCGGCGA TCGTGCGCCG CGAGGACGGT TCCTGGCTGG TCGACGGTGG CATGGGTATC
GAGCAGTTGA AAACCGCGCT GGACATCGAC GCGGCGTTTC CGGGCGAGGC GGACAACGCC
TATCGCACGC TCGCCGGTCT GGTCATGCAT TGCCTGCAGC GCGTGCCGAG CGTTTCCGAT
CACTTCGAAC TCGACGGCTG GCGCTTCGAG GTGGTCGACA TGGACCGGAC GCGCATCGAC
AAGGTGCTGG TGATGCGCCC GAGCGCGCCG CCGGCCTGA
 
Protein sequence
MEAGLLLLLI ALNGVFAMAE IAVVSSRKAR LQRLADERRH GALVALALQD DPSRFLSTIQ 
VGITTVGVLS GALGETLLAR PLTQRIAELP ALAPYAEAIA LILTVAAITY LSVVVGELLP
KRLALLAPET IASALAPAMR RLAQIAAPLV WLLSYSCNLL LRLIGIGRRD EPPITDEEIQ
VLMEQGAEAG VFHESEQAFV ANVLHLDEQP VGAIMSPRQQ VYAIDLDDPR EEQLRRLAES
PYTRVLVCRG GLHRLVGVLH RGDLLKPALQ GQPIDLQRSA RPPFYVEESA SSTGLLEDFR
RTRNEFAVVV DEFDDLQGIV TLRDVLTAIV GEIPDALHDG EPAIVRREDG SWLVDGGMGI
EQLKTALDID AAFPGEADNA YRTLAGLVMH CLQRVPSVSD HFELDGWRFE VVDMDRTRID
KVLVMRPSAP PA