Gene Avin_34920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_34920 
Symbol 
ID7762387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3561944 
End bp3563428 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content67% 
IMG OID643806358 
Producthypothetical protein 
Protein accessionYP_002800616 
Protein GI226945543 
COG category[S] Function unknown 
COG ID[COG2326] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.195205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGAAT CCGCCGAGAT CGGCCACCAG ATCGACAAGG AAAGTTACGA AGCGGCCGTG 
CCGGCGCTGC GCGAGGCACT GCTCGAGGCC CAGTACGAAC TCCGCCAGCA GGCACGTTTC
CCGGTGTTGG TGCTGATCAG CGGTATCGAG GGCGCCGGCA AGGGCGAGAC GGTCAAGCTG
CTCAACGAGT GGATGGACCC GCGGCTGATC GAAGTCAGCA CGTTCGACCA GCAGACCGAC
GAGGAACTGG CGCGCCCACC GGTCTGGCGC TACTGGCGTC AGTTGCCGCC GAAGGGGCGG
ATCGGCATCT TCTTCGGCAA CTGGTACAGC CAGATGTTGC AGGCGCGGGT GCACGAGCGG
ATCGACGATG CCCGCCTCGA CCAGGCCATC GACGGCGCCG AGCGCCTGGA GCGGATGCTC
AGCGACGAAG GTGCGCTGAT CTTCAAGTTC TGGTTCCACC TTTCCAAGAA GCGCATGAAG
GAGCGCCTGG CGCTTCTCAA GGACGATCCC CTGCACAGTT GGCGGCTGAG TCCGCTGGAC
TGGCAGCAGT CGAAGACCTA CGGCAAGTTC GTGCGCTACG GCGAGCGGGT GCTGCGGCGC
AGCAGCCGGG ACTTCGCGCC CTGGTACGTG ATCGAGGGCT CCGATGCCAA TTACCGCAGC
CTGAGCGTCG GGCGCATTCT CCTCGACGGC CTGCAGGCGG CCCTCGGGCA CCGGGGCCGG
CCGGCCCACC GCCCGCACGC GGCGCCGCTG GTGTCCAGCG TGGACAACCG TGCCCTGCTG
GACTCCCTGG ACATGACCCA GGCCCTCGCC AAGCCGGATT ACCAGCGCCT GCTGATCGCC
GAGCAGGCGC GCCTGGCCCT GCTGATGCGC GACAAACGCA TACGCCGGCA TGCCCTGGTG
GCGGTGTTCG AGGGCAACGA CGCGGCCGGC AAGGGCAGCT CTATCCGCCG TGTCGCCGCC
GCCCTGGACC CGCGCCAGTA CCGGATAGCG CAGATCGCCG CGCCGACCGA GGAGGAGCGC
GCCCAGCCCT ACCTCTGGCG TTTCTGGCGG CATATTCCGC CGCGCGGCAA GTTCACCATC
TTCGACCGCT CCTGGTACGG ACGTGTGCTG GTGGAGCGGG TCGAGCGCCT GTGCAGCGAG
GCCGACTGGC TGCGCGCCTA CGGCGAAATC AACGATTTCG AGGAGCAGTT GAACGATGCC
GGGGTGGTGC TGGTCAAGTT CTGGCTGGCC ATCGACCGGG AGACCCAACT GGTGCGCTTC
AAGGAGCGCG AAGCGACACC CTTCAAGCGC TTCAAGATCA CCGAGGAAGA CTGGCGCAAC
CGCGACAAGT GGGAGGACTA CAGCGACGCG GTGGGCGACA TGGTCGACCG CACCAGCAGC
GAGATCGCCC CCTGGACCCT GGTCGAGGCC AACGACAAGC GCTTCGCCCG GGTGAAGATC
CTGCGCACCC TCAACGACGC GCTGGAGAAG GCCTTGCGCG GCTGA
 
Protein sequence
MFESAEIGHQ IDKESYEAAV PALREALLEA QYELRQQARF PVLVLISGIE GAGKGETVKL 
LNEWMDPRLI EVSTFDQQTD EELARPPVWR YWRQLPPKGR IGIFFGNWYS QMLQARVHER
IDDARLDQAI DGAERLERML SDEGALIFKF WFHLSKKRMK ERLALLKDDP LHSWRLSPLD
WQQSKTYGKF VRYGERVLRR SSRDFAPWYV IEGSDANYRS LSVGRILLDG LQAALGHRGR
PAHRPHAAPL VSSVDNRALL DSLDMTQALA KPDYQRLLIA EQARLALLMR DKRIRRHALV
AVFEGNDAAG KGSSIRRVAA ALDPRQYRIA QIAAPTEEER AQPYLWRFWR HIPPRGKFTI
FDRSWYGRVL VERVERLCSE ADWLRAYGEI NDFEEQLNDA GVVLVKFWLA IDRETQLVRF
KEREATPFKR FKITEEDWRN RDKWEDYSDA VGDMVDRTSS EIAPWTLVEA NDKRFARVKI
LRTLNDALEK ALRG