Gene Avin_17100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_17100 
Symbol 
ID7760645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1693872 
End bp1695221 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content71% 
IMG OID643804609 
Producthypothetical protein 
Protein accessionYP_002798898 
Protein GI226943825 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.550956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACGA TCCATATCGG TTGTGGGGCG GGCTTTGCCA ACGATCGCCC CGATGCGGGA 
CTGCGCCTGG CGCAGGATCT GGCCCGACGT TCCGGCCGAC GCTATCTCAT GTACGAACTC
CTGGCCGAAC GCACGCTGGC CGAGGCGCAG TTGCGCAAGC AGGCCGATCC CCGCGCGGGA
TACGCCGCGC GTCTGTTCGA CTTCCTCCAG CCGGTGCTGG ACACCTGCAT CGAGGCGGGT
ATCCCGATCG TCACCAATGG CGGCGCGGCC AACCCGCGTG CGGCGGCCGA GCGGCTGCGG
GCCGAACTGG GCGGGCGCCA TGCCGGCCTG CGTATCGCCT GCGTGCTGGG CGACGATCTG
ATGGGGATGG ACCGCCGGCG CCTCGGCCAG TGGCTCGACC TCGGCGACCC GCGGGACGAG
CTGGTTTCGG CGAACGTCTA CAGCGGCGCC GACGGCATCG TCCGGGCCCT GGACGAGGGC
GCCGCCATCG TGCTCTGCGG ACGGGTCGCC GACCCGTCCC TGGCCGTCGG TCCGATCCGC
CACGCCCTGG GCTGGGCCGC CGACGACTGG GAGCGGATGG CCATCGCCAC CGTGGCCGGA
CACCTGCTGG AATGCTGCAC CCAGGCCACC GGCGGCTATT TCGCCCATCC CGGTCTCAAG
GAGGTGCCCG ATCCGGCCAA TCTCGGCTGT CCGATCGCCG AGGTCGCCGC GGACGGTCGC
CTGGTGATCA CCAAGACCGC CGGTTCCGGC GGTTGCGTCA GCGAGCGCAC GGTCAAGGAG
CAACTGCTCT ACGAGGTGCA TGATCCGCGC CGCTATCTCA CCCCCGACGT GGTCCTCGAC
CTCGGCGCGG CACGGGTGGA GGCCATCGGC GCCGATCGCG TCGCGGTCGG CGGCATCCAC
GGCCATCCGC GCCCCGATAC GCTCAAGGGG CTGGCCGGCG TGCGCGGGCT CTGGTTCGGC
GAGGCGGAAA TCTCCTACGC CGGTGCCGGC GCCGTGGCCC GGGCACGGCT CGCCCGGGAG
ATCCTGCTGC AGCGCTTCGA CCTGCTGGCG CCGGGCGTGC AGCCCTGGAT CGATCTGGCC
GGCGTCGCCA GCCTGTTCAA CGATGCGCGC GGCGACTATC TCGCCCGGCG CCTGGACCTG
GCGCCCGAGG TGGACGACGT GCGCGTCCGG GTCGGCCTGG TGCATCGCGA CCGCGCTCTG
ATCGAGACGC TGCTGGCCGA GGTGGAGTCG CTCTACACCA ACGGTCCGGC CGGTGGCGGC
GGGGTGCGCC GGCATATCGG CGAATCCATC GCCACCCGCG ACTTCCTGAT TCCCCGCGAG
GCAATCGAAA CACGTCTGGA GTGGTACTGA
 
Protein sequence
MTTIHIGCGA GFANDRPDAG LRLAQDLARR SGRRYLMYEL LAERTLAEAQ LRKQADPRAG 
YAARLFDFLQ PVLDTCIEAG IPIVTNGGAA NPRAAAERLR AELGGRHAGL RIACVLGDDL
MGMDRRRLGQ WLDLGDPRDE LVSANVYSGA DGIVRALDEG AAIVLCGRVA DPSLAVGPIR
HALGWAADDW ERMAIATVAG HLLECCTQAT GGYFAHPGLK EVPDPANLGC PIAEVAADGR
LVITKTAGSG GCVSERTVKE QLLYEVHDPR RYLTPDVVLD LGAARVEAIG ADRVAVGGIH
GHPRPDTLKG LAGVRGLWFG EAEISYAGAG AVARARLARE ILLQRFDLLA PGVQPWIDLA
GVASLFNDAR GDYLARRLDL APEVDDVRVR VGLVHRDRAL IETLLAEVES LYTNGPAGGG
GVRRHIGESI ATRDFLIPRE AIETRLEWY