Gene Avin_18820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_18820 
Symbol 
ID7760816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1866510 
End bp1867448 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content72% 
IMG OID643804780 
Producthypothetical protein 
Protein accessionYP_002799069 
Protein GI226943996 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000705752 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACACGC TGGCGGTGGC GGACGGCATC CGGGTGGGTC TGGGCGAGTT GATCGAGATC 
CGCCACCGGG CGCGCGAGCT GCAACTGTTC TCCAGCGCGG CACGCCGCAG CGCGCTGCTC
GGCGCGCACC ACTCGCGGCT GCGCGGGCGC GGCGTGGACT TCGACCAGGT GCGCGTCTAC
CAGGCCGGCG ACGACGTGCG CAACATCGAC TGGCGGGTCA CCGCGCGCAC CCTGGAACCG
CACACCAAGC TGTTCCACGA AGAGCGCGAG CGGCCGATCT TCATCCTCGC CGAGCAGAGC
CGGCAACTGT TCTTCGGCTC CTCGCGGCTG TTCAAGTCGG TACTCGCCGC CCAGGCCGCG
GCGCTGATCG GCTGGGCCGC CCTGGAACAC AACGACCGGG TCGGCGGGCT GGTGTTCGGC
AGCAGCGCGC CCCACGAGAT CAAGCCGCGG CGCAGCAAGC AGAGCCTGCT GCAACTGCTC
AACCGCCTGG TGCGCGCCAA CCACGCCCTG CACGGCGAAC TGCCGGACGA GCCGGACAGC
TTCGGCCAGG CCCTGCGCCG CACCCGCGAG GTGCTGCGCC CGGGCAGCCT GGTGGTGGTG
CTGTGCGACG AGCGGGCGCT GTCGGATACC GCCGAGCGCC AGTTGCTGCT GCTCGGCCGG
CACAGCGAAC TGCTGCTGCT GCCGCTCTCC GACCCCCTCG ATCACGCCCT GCCCGCCGCC
GGCCTGCTGC GCTTTTCCCA GGACGGCGCG CAACTGGAGC TGGACACCCA CGACAGCGGG
CTGCGCCAGG CCTACCGAAG CCTCGGCCAG GCGCGCCAGG CGCGCTGGGA ACGCCTCGCC
GAACGCCTCG GCAGTCTGCT GCTGCCGCTC AGCACGCAGT TCGAACTGAT CGAGCAGTTG
CGCGAGCGCC TGCAGCCGCG ACCGGTATGG CGGCCATGA
 
Protein sequence
MHTLAVADGI RVGLGELIEI RHRARELQLF SSAARRSALL GAHHSRLRGR GVDFDQVRVY 
QAGDDVRNID WRVTARTLEP HTKLFHEERE RPIFILAEQS RQLFFGSSRL FKSVLAAQAA
ALIGWAALEH NDRVGGLVFG SSAPHEIKPR RSKQSLLQLL NRLVRANHAL HGELPDEPDS
FGQALRRTRE VLRPGSLVVV LCDERALSDT AERQLLLLGR HSELLLLPLS DPLDHALPAA
GLLRFSQDGA QLELDTHDSG LRQAYRSLGQ ARQARWERLA ERLGSLLLPL STQFELIEQL
RERLQPRPVW RP