Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_18820 |
Symbol | |
ID | 7760816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1866510 |
End bp | 1867448 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643804780 |
Product | hypothetical protein |
Protein accession | YP_002799069 |
Protein GI | 226943996 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000705752 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACACGC TGGCGGTGGC GGACGGCATC CGGGTGGGTC TGGGCGAGTT GATCGAGATC CGCCACCGGG CGCGCGAGCT GCAACTGTTC TCCAGCGCGG CACGCCGCAG CGCGCTGCTC GGCGCGCACC ACTCGCGGCT GCGCGGGCGC GGCGTGGACT TCGACCAGGT GCGCGTCTAC CAGGCCGGCG ACGACGTGCG CAACATCGAC TGGCGGGTCA CCGCGCGCAC CCTGGAACCG CACACCAAGC TGTTCCACGA AGAGCGCGAG CGGCCGATCT TCATCCTCGC CGAGCAGAGC CGGCAACTGT TCTTCGGCTC CTCGCGGCTG TTCAAGTCGG TACTCGCCGC CCAGGCCGCG GCGCTGATCG GCTGGGCCGC CCTGGAACAC AACGACCGGG TCGGCGGGCT GGTGTTCGGC AGCAGCGCGC CCCACGAGAT CAAGCCGCGG CGCAGCAAGC AGAGCCTGCT GCAACTGCTC AACCGCCTGG TGCGCGCCAA CCACGCCCTG CACGGCGAAC TGCCGGACGA GCCGGACAGC TTCGGCCAGG CCCTGCGCCG CACCCGCGAG GTGCTGCGCC CGGGCAGCCT GGTGGTGGTG CTGTGCGACG AGCGGGCGCT GTCGGATACC GCCGAGCGCC AGTTGCTGCT GCTCGGCCGG CACAGCGAAC TGCTGCTGCT GCCGCTCTCC GACCCCCTCG ATCACGCCCT GCCCGCCGCC GGCCTGCTGC GCTTTTCCCA GGACGGCGCG CAACTGGAGC TGGACACCCA CGACAGCGGG CTGCGCCAGG CCTACCGAAG CCTCGGCCAG GCGCGCCAGG CGCGCTGGGA ACGCCTCGCC GAACGCCTCG GCAGTCTGCT GCTGCCGCTC AGCACGCAGT TCGAACTGAT CGAGCAGTTG CGCGAGCGCC TGCAGCCGCG ACCGGTATGG CGGCCATGA
|
Protein sequence | MHTLAVADGI RVGLGELIEI RHRARELQLF SSAARRSALL GAHHSRLRGR GVDFDQVRVY QAGDDVRNID WRVTARTLEP HTKLFHEERE RPIFILAEQS RQLFFGSSRL FKSVLAAQAA ALIGWAALEH NDRVGGLVFG SSAPHEIKPR RSKQSLLQLL NRLVRANHAL HGELPDEPDS FGQALRRTRE VLRPGSLVVV LCDERALSDT AERQLLLLGR HSELLLLPLS DPLDHALPAA GLLRFSQDGA QLELDTHDSG LRQAYRSLGQ ARQARWERLA ERLGSLLLPL STQFELIEQL RERLQPRPVW RP
|
| |