Gene Avin_52100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_52100 
Symbol 
ID7764047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5322077 
End bp5323537 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content70% 
IMG OID643808026 
Producthypothetical protein 
Protein accessionYP_002802260 
Protein GI226947187 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGGTG CATTGCTTCG TACCGGAACG CTGCGGGAAT TCAGGGCGCT CGGCGTCGAT 
GGCCAGCCGG TGCACGGCGT GGCGCTGCAG TTGCGCGAGG CCATCCGCCT GAAGATGCAG
CGCGAGGCGG CCGATTGCCT GGCCATCCCG CAATCCAACG AGGCGGGGGA CCGCATCGAC
TGGTACGCGC CGTTCGAGGG CGACGTAGTG CCCTGGTCGG CGGCCACCGC GGAAGAGCGC
ATCCAGGCCC GTGCCCGGCT CGAGGCGATG CAGGCCCGGT TGCGGGCCAC CGGCGAGAAC
ATGCGCGACG ACGTGCAGAA CCGGGAGAAG CAGGTCTTCG GCCGTCTCTT GGAAAAGGCC
CTGTATTTTC CCGATGCCGA CCATGTCTAT CTGGTCGACG GCCGGCCGGT GGTCACCTTC
TGGGGCTTCA CCCGGCAGCA GGACGGCCAG TCGCCCGATC CTCTCGCCTG CCTGCAGGTC
GCCAGGCCCG CCCCGGCCCC CGTGGCGGAC ACCGTCTTGC CGCCGCCCGT GACGCCTGCG
GCGGCAGCCG CGGCCGCGGT TGCGGAAAAA CCCCGCTGGC GGCGCTGGCT GTGGCTGCTG
CTCCTGCCGC TGCTCCTGCT GTTGCTGCTG TTCCTCATGC GCGCCTGCGC GCCGACCGTC
GAACTGCCCT TCGATCTGTC GCATGTCGAC CTGCCCGGCC TGCCGGCCAG GGAAAGGGTC
GCGGAAGAGG TCCGGCTGCG CGAGGAGGTG GTGGGCGTGA CGGGCGCGGC CGGTGTCGTC
GGAACCGAGG GAGAAGGTAG CGTGCCGGTA CCGGACGGCG AAATGACTGT CGAGGAAGTG
CCGCTCGAAG AGGGATCGGC GAGCGAATCC GAAGCGGGCG AAGCCGCGGC GGTTGACCCC
GCGGCCGAAG AGGCGACGCA GGACCGGCAA CCGTCCGCCG GGGACGGAGA GAAGGAGCCG
GAGGCGACGC CCGAAGACGC ACAACAGAAG CCGCCGGTTC CGCCGCAACT CAACGAGGAA
AAGCCGGCGC AAGACCCGAA GGCCGCGCAG GAGCAGGAAA AAGGAGCCGG GGAACAGCAA
GGCGCCAAGC CCATGAGCAT TCCGCCCGAG GCGCTGAAGA GCGGTTCGAC CCGTTTCCTC
GACGGCAACT GGCGGGCCGG CGCCGGCATC CAGGACGCCA AGACCGGCAA GCCGCTGCAG
CTGGGTTACG ACTTCAAGGA CGGCAAGGGC CAGGTCAGCA TCCGCCGTGA CGACGGTGTG
CGCTGCGCGG GCCCGGTGAA CGCGACCGTG CAGGGCGGCA GCCTGGCGAT CGCCAGCCAG
GGCCAGGCGA CCTGCAGCGA CGGCAGCCAC TACCGCATGC CGGAAGTGAC CTGCAAGCCG
GATGCGCGCA GCGCGGCCGA CTGTACCGGC CGCTACGGCG ACCAGGAATT CCCCATGTCG
ATCCGCCAGG GCGGCAACTG A
 
Protein sequence
MPGALLRTGT LREFRALGVD GQPVHGVALQ LREAIRLKMQ REAADCLAIP QSNEAGDRID 
WYAPFEGDVV PWSAATAEER IQARARLEAM QARLRATGEN MRDDVQNREK QVFGRLLEKA
LYFPDADHVY LVDGRPVVTF WGFTRQQDGQ SPDPLACLQV ARPAPAPVAD TVLPPPVTPA
AAAAAAVAEK PRWRRWLWLL LLPLLLLLLL FLMRACAPTV ELPFDLSHVD LPGLPARERV
AEEVRLREEV VGVTGAAGVV GTEGEGSVPV PDGEMTVEEV PLEEGSASES EAGEAAAVDP
AAEEATQDRQ PSAGDGEKEP EATPEDAQQK PPVPPQLNEE KPAQDPKAAQ EQEKGAGEQQ
GAKPMSIPPE ALKSGSTRFL DGNWRAGAGI QDAKTGKPLQ LGYDFKDGKG QVSIRRDDGV
RCAGPVNATV QGGSLAIASQ GQATCSDGSH YRMPEVTCKP DARSAADCTG RYGDQEFPMS
IRQGGN