Gene Avin_50810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50810 
Symbol 
ID7763929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5149857 
End bp5151338 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content66% 
IMG OID643807909 
ProductDUF877 family protein 
Protein accessionYP_002802143 
Protein GI226947070 
COG category[S] Function unknown 
COG ID[COG3517] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03355] type VI secretion protein, EvpB/VC_A0108 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCAGC CGTCCGCCGG CGAGACCCAA GCCAGCGAGA ACATCACCCT GTCCCTGCTC 
GACCGCATCA TCGCCGAGGG CCGCATGGCC CACGACGACA GCCAGCAGGA CTACGCCCGC
GACATGCTCG CGGAGTTCGC CACCCAGGTC CTCGACGAAG GCATGGCCAT CGACAAGGAC
ACCGTGTCGA TGATCAACGA CCGCATCGCC CAGATCGATG CGCTGATCGG CGCCCAGCTC
GACGAAATCC TCCACCATCC CGAGTTGCAG AAGCTGGAAG CCTCCTGGCG CGGCCTGCAC
ATGCTGGTGA AGAACACCGA GACCGGCGCG CGTCTGAAGC TGCGCCTGCT CAACGTGACC
CAGAAGGAAC TGCTGATCGA TCTGGAGAAG GCCGTCGAGT TCGACCAGAG CGCGCTGTTC
AAGAAGATCT ACGAGGAAGA GTACGGCACC TTCGGCGGGC ATCCGTTCAG CCTGCTGGTC
GGCGACTACA GCTTCGGCCG CCATCCGCAG GACATCGGCC TGCTGGAAAA GCTCTCCAAC
GTCGCCGCCG CCGCCCATGC GCCCTTCATC GCCGCGGCCA GCCCGCGCCT GTTCGACATG
GGCAGTTTCA CCGAACTGGC GGTGCCGCGC GATCTGGCGA AGATCTTCGA GAGCCAGGAA
CTGATCAAGT GGCGCGCCTT CCGCGAGAGC GAGGATTCGC GCTACGTCTC CCTGGTGCTG
CCGCACGTCC TCCTGCGCCT GCCCTACGGA CCGGACACCT GCCCGGTGGA AGGCATGGAC
TACGTCGAGG ATGTCAACGG CCGCGACCAC GCCAGGTACC TCTGGGGCAA TGCCGCCTGG
GCCCTGACCC AGCGCATCAC CGAGGCCTTC GCCCGCTATG GCTGGTGCGC GGCGATTCGC
GGCGTGGAAG GCGGCGGCGC GGTCGAGGGC CTGCCGGCGC ACAGCTTCCG CACCAGCTCC
GGCGACCTGT CGCTGAAATG CCCGACCGAG GTGGCGATCA CCGACCGTCG CGAAAAGGAA
CTCGACGCAC TCGGCTTCAT CGCCCTCTGC CACAAGAAGA ACAGCGACCT GGCGGTGTTC
TTCGGCAGCC AGACGACCAA CAGGCCCAGG GTCTACAACA CCAACGAGGC CAACGCCAAC
GCCCGTATCT CGGCGATGCT GCCCTATGTC CTGGCCGCCT CGCGCTTCGC CCACTATCTC
AAGGTGATCA TGCGCGACAA GGTCGGCAGC TTCATGACCC GCGACAACGT GCAGACCTAC
CTGAACAATT GGATCGCCGA CTACGTGCTG ATCAACGACA ACGCCCCGCA GGAGATCAAG
GCGCAGTATC CGCTGCGCGA AGCGCGGGTG GACGTTTCGG AAGTCGTCGG CAAACCGGGC
GTCTATCGCG CCACGGTGTT CCTCCGCCCG CACTTCCAAC TGGAAGAGCT GACCGCCTCG
ATCCGCCTGG TCGCCACGCT GCCGCCGCCG GCCGCGGCCT GA
 
Protein sequence
MAQPSAGETQ ASENITLSLL DRIIAEGRMA HDDSQQDYAR DMLAEFATQV LDEGMAIDKD 
TVSMINDRIA QIDALIGAQL DEILHHPELQ KLEASWRGLH MLVKNTETGA RLKLRLLNVT
QKELLIDLEK AVEFDQSALF KKIYEEEYGT FGGHPFSLLV GDYSFGRHPQ DIGLLEKLSN
VAAAAHAPFI AAASPRLFDM GSFTELAVPR DLAKIFESQE LIKWRAFRES EDSRYVSLVL
PHVLLRLPYG PDTCPVEGMD YVEDVNGRDH ARYLWGNAAW ALTQRITEAF ARYGWCAAIR
GVEGGGAVEG LPAHSFRTSS GDLSLKCPTE VAITDRREKE LDALGFIALC HKKNSDLAVF
FGSQTTNRPR VYNTNEANAN ARISAMLPYV LAASRFAHYL KVIMRDKVGS FMTRDNVQTY
LNNWIADYVL INDNAPQEIK AQYPLREARV DVSEVVGKPG VYRATVFLRP HFQLEELTAS
IRLVATLPPP AAA