Gene Avin_42040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_42040 
Symbol 
ID7763082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4232559 
End bp4233719 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content71% 
IMG OID643807057 
Producthypothetical protein 
Protein accessionYP_002801306 
Protein GI226946233 
COG category[S] Function unknown 
COG ID[COG3177] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGACT CACGCTGGAT CTGGCAGCAA CCGGATTGGC CGCATTTCCG CTGGCAGGCC 
GAGCGCCTGG CGCCGCTGCT GCGCGACTGC GCGCAGGCCC AGGGCCGACT GCTCGGTATG
GCCGGCGCGG CAGGGGACGA ACTCGGCGCG CAGAGCGAGC TGGACGCCCT GCTGCAGAAC
ATCGTCACCT CCTCGGCCAT CGAAGGCGAG CGGCTCGACG TCGGCTCGGT GCGCTCCTCG
CTGGCCCGCC GCCTGGGCCT CGAAACGGAC GGCGCCAGCC GCGTCAGCCC GCGCAGCGAA
GGTCTCGCCG AGCTGATGCT GGACGCCACC CGGCACCTGG AGCAACCGCT GAGCCTCGAA
CACCTGCTGC ACTGGCACCG CCTGCTGTTT CCCGCGCAGG ACGACGACCT GCTGCCACGG
CGCATCCGCG TCGGCGCCCT GCGCGGCGAA GAGCCTATGC AGGTGGTCTC CGGCCGGCTC
GACCGGCTCA CCGTGCATTT CGAGGCGCCG CCGCGCGACG GGCTGGAACG GCAACTGGCC
GCCTTCCTCG ACTGGTTCGA GGCGAGCCGC CGCGACGGCG GCCTCGACCC GCTGCTGCGC
GCCGGCATCG CCCATTTCTG GTTCGTCACC CTGCATCCCT TCGACGACGG CAACGGCCGG
CTGACCCGCG CCATCACCGA CCTGGCGCTG GCCCAGGGCG AGCGCCAGGC GATCCGCCTG
CATGCCATGT CGGCCAGTAT CCTCGACGAC CGCCAGGGCT ACTACCGCAT CCTCGAAGCC
AACCAGAAGG GTGGCCCGGA GATCACCCCC TGGCTGGAAT GGTTTCTCCA TACCCTGCTG
CGCAGCCTGC AACAGGCGCT GGCGCGCATC GACCGGGTGC TGGCCAAGTC GCGCTTCTGG
CAGCGGCACC GCCATCAGGC GCTGTCCGCC GAACAGATCA AGGTGCTCGA TCGCCTGCTC
GACGGCGGCG AACGCGGCTT CGAAGGCGGC ATCAGCGCCG CCCAGTACCA GGCCGTCGCC
AGGGTTTCCA AGGCCACCGC CACCCGCCAC CTCGGTGACC TGCTCGACAA GGGCTGGCTC
GTCCGCCTGC CCGGCGGCGG GCGCAGCACC CGTTACCGGA TCGACTGGCC CACCACCGGC
GGGCCCGATA CATCCCTGTA G
 
Protein sequence
MEDSRWIWQQ PDWPHFRWQA ERLAPLLRDC AQAQGRLLGM AGAAGDELGA QSELDALLQN 
IVTSSAIEGE RLDVGSVRSS LARRLGLETD GASRVSPRSE GLAELMLDAT RHLEQPLSLE
HLLHWHRLLF PAQDDDLLPR RIRVGALRGE EPMQVVSGRL DRLTVHFEAP PRDGLERQLA
AFLDWFEASR RDGGLDPLLR AGIAHFWFVT LHPFDDGNGR LTRAITDLAL AQGERQAIRL
HAMSASILDD RQGYYRILEA NQKGGPEITP WLEWFLHTLL RSLQQALARI DRVLAKSRFW
QRHRHQALSA EQIKVLDRLL DGGERGFEGG ISAAQYQAVA RVSKATATRH LGDLLDKGWL
VRLPGGGRST RYRIDWPTTG GPDTSL