Gene Avin_18800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_18800 
Symbol 
ID7760814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1865013 
End bp1866020 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content74% 
IMG OID643804778 
Productvon Willebrand factor, type A (VWA) domain protein 
Protein accessionYP_002799067 
Protein GI226943994 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00117552 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGAAT TCGCCTGGCC CTGGGTCTTC CTGCTCGCCC CGCTGCCCTG GCTGCTGCGC 
CTCGTGCTGC CGCCGGCCGA CAGCGGCGAG ACGGCGCTGC GGGTGAGCTT CCTCGGCGAG
CTGGAAAGCC TCAGCGGCCG CCGCGCCCGC CTGCGCCTGC CGGGCTGGCG GCAACAGGCG
CCGTTCGTCC TGCTCTGGCT GCTGCTGCTC GGCGCCGCCG CGCGCCCCGA ATGGGTCGGC
GAACCCCGGC CGCTGCCCGC CAGCGGCCGC GATCTGCTGC TGGCGGTAGA CGTTTCCGGC
TCCATGGAAT ACGCCGACAT GCACTGGCAG GGCGAGAGCA TCGGCCGCCT GGAACTGGTC
AAGCACCTGC TCGGCCAATT CATCGAGGAC CGCCGGGGCG ACCGCGTCGG GCTGATCCTG
TTCGGCAGCC AAGCCTACCT GCAGGCGCCG CTGACCTTCG ATCGCCGGAC CGTGCGCACC
TGGCTGGAGG AAGCCGCGAT CGGCATCGCC GGCAAGGACA CCGCCATCGG CGACGCCATC
GGCCTGGGCC TCAAGCGCCT GCGCCAGCGT CCGGCGCAGA GCCGCGTGCT GATCCTGGTC
ACCGACGGCG CCAACACCGC CGGCGAGATC GCTCCGTCGG TCGCCGCCCG CCTGGCCGCC
GCGGAAGGGG TACGCATCCA TACCATCGGC ATCGGCGCCG ATCCCCGGCA GGACGGACCG
CCCGGCCTGC TCGGCCTGAC GCCGGGACTG GATCTCGACG AGCCGACCCT GCGCGCCATC
GCCGAAGAGA CCGGCGGCAG CTACTTCCGC GCCCGCAGCA GCGAGGAACT GCGCGCCATC
GAGGAAACCC TCGCGCGCCT GGAGCCGGTC GCCCAGCCGC CGACCCAGGC GCGCCCGGCC
CGCCCGCTGT ATCCCTGGCC GCTGGCCACG GCGCTATTGC TCGGCCTGCT GCTGGTGGCC
CGCAGCCTCT GGCCGGCGCG CGCGCGCTCG CGAGGAACGC GCCGATGA
 
Protein sequence
MFEFAWPWVF LLAPLPWLLR LVLPPADSGE TALRVSFLGE LESLSGRRAR LRLPGWRQQA 
PFVLLWLLLL GAAARPEWVG EPRPLPASGR DLLLAVDVSG SMEYADMHWQ GESIGRLELV
KHLLGQFIED RRGDRVGLIL FGSQAYLQAP LTFDRRTVRT WLEEAAIGIA GKDTAIGDAI
GLGLKRLRQR PAQSRVLILV TDGANTAGEI APSVAARLAA AEGVRIHTIG IGADPRQDGP
PGLLGLTPGL DLDEPTLRAI AEETGGSYFR ARSSEELRAI EETLARLEPV AQPPTQARPA
RPLYPWPLAT ALLLGLLLVA RSLWPARARS RGTRR