Gene Avin_52120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_52120 
Symbol 
ID7764049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5324279 
End bp5326285 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content63% 
IMG OID643808028 
ProductTonB-dependent receptor 
Protein accessionYP_002802262 
Protein GI226947189 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.618776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCACC CCGCATCTCG CCTGCCCCTT TTGGCGGGCG GCCTCTGTCT TTGCCTGAAC 
GGTCTGTCCC AGGCCGATGA CAATGTCTTC GAACTCGGCC AGATCTTCGT GCTGGGCAGC
CATGGCGACG CCAGCCGCCT CGACAACACG GACTCGGTGG ACAGCCGGGA CATGCGCCTG
CACGACCGCG AAACCGTCGG CGAGGCGCTC AACCTGATTC CCGGGGTCAG TCTCAGCAAG
ACCGGCGCTC GCAACGAGCA GATGGTCTAC GTGCGTGGCT TCGATCTGCG CCAGGTGCCC
GTCTTCATCG ACGGCATCCC GGTGTACGTG CCCTACGACG GCTATCCCGA TCTCGGCCGC
TTCACCACCT TCGAACTATC CAGGATCGAG GTCTCCAAGG GCTTCAGCTC CATGACCTAC
GGCCCGAACA CCCTGGGCGG GGCGATCAAC CTGGTGACCC GGCGGCCGCA GGAGGAATTC
GAAGGCGAGA TCGGCACCGG TTTCAGCTTC ACCGACCGAG CGGAGAACAA CGGCGAATGG
ACATACGCCA ACTTCGGCAC CCGGCAGGAC AACTGGTGGG CGCAGATGGG TCTGTCCTAC
CTCAACGAGG ATTATTTCCG CCTGCCCGGC AACTTTGACG ACGAACGCCT GGAGGATGGC
GGGCGGCGCG GCAACAGCGA CCGCAAGGAC AAGAAGATCA ACTTCAAGGT CGGCTTCACG
CCCAACGACA CCGACGAATA CGTGCTGGGC TACGTCAAGC AGGACGGCGA AAAGGGCAAC
CCCGCCTACG CCGGACACCT GGGCGGCAGC TACAACCGCT TCTGGCGCTG GCCCAAGTGG
GACAAGACCA GCTACTACAT CAACACGGTC ACCGCTTTCG GCGATCACAA GGTCAAGGCC
CGCCTCTACC ACGACATCTA CAAGAACGAC CTGTACTCCT ACGACAACGA CAGCTACAGC
ACGCAGAAAA CCTCCCGGGC GTTTCGCAGC TACTACGACG ACTACAGCAC CGGTCTCAGC
CTCGAGGACG AGTGGACCCT GGACGAAAAC AACCAGTTGC GCCTGGCCTT CCACCACAAG
CAGGACGTAC ACCGCGAGCA CGACGCCGGC GAGCCCAAGC AGCGTTTCGA GGACGAAACC
CAGTCCCTCG CCCTGGAGTA CACCCGCAAG CTGACCGAAC GTCTCACCCT GATCGCGGGT
TTCTCCCATG ACCGGCGCAA CGGCCGCGAA GCCGAGACCT ACACCAGCGC CGCCGGCCTG
TTCGAGGAGG AAGGCGGGCG CGAATCCACC AACAACGGCC AGCTCGGGCT GTTCTTCCAG
GCCGACCCGC GGACCCTGTG GCGCTTCACC GTGGCGCGCA AGAGCCGCTT CCCGACCATG
AAGGATCGCT ACTCCTACCG TTTCGGCAGC GCCCTGCCCA ACCCCGACCT GAAAACCGAG
GAAGCCACCC ATTTCGAGAT CGGCTACAAG CGTGCCCTGA GCGACAGCCT GGAACTGGAT
CTGGCGCTGT TCCGCAGCCA CGTGGACGAC CTGCTGCAAT CGGTGCGCGT GGCCGGCAGC
GCCTGCTCCA ACCCGCCCTG CTCGCAGATG CAGAATGTCA GCGAGGCGCG CATGAACGGC
GTGGAAGCCT CGCTGAACGG CAACCTGGGC GCCTGGGAAG TGAACTTCAA CTACCTCTAC
CTCAACCGCC AGAACCGCTC CTCCGACGAC TCCATACGGC TCACCGACAC GCCCAAGCAC
AAGGGCTTCC TCAGTATCGG CCGGTGGTTC GGCCCATGGC ACCTGATGGC CAGCTCGGAC
GCTTCCTCCA GCCGCTACAG CTCCACCGAC GGCACCCAGG AAGCTTCCGG ATTCGTGGTC
TTCAACCTGA AGGGCGGCTA TCGCTTCGAC AACGGCCTGC AACTGGACGC CAGCGTGCAG
AACCTCACCG ACCACGAGTA CGAATACAGC GAAGGCTATC CCGAACCGGG CCGCACCTTC
GTCGTGCAGG CCAACCTGAC TTTCTGA
 
Protein sequence
MQHPASRLPL LAGGLCLCLN GLSQADDNVF ELGQIFVLGS HGDASRLDNT DSVDSRDMRL 
HDRETVGEAL NLIPGVSLSK TGARNEQMVY VRGFDLRQVP VFIDGIPVYV PYDGYPDLGR
FTTFELSRIE VSKGFSSMTY GPNTLGGAIN LVTRRPQEEF EGEIGTGFSF TDRAENNGEW
TYANFGTRQD NWWAQMGLSY LNEDYFRLPG NFDDERLEDG GRRGNSDRKD KKINFKVGFT
PNDTDEYVLG YVKQDGEKGN PAYAGHLGGS YNRFWRWPKW DKTSYYINTV TAFGDHKVKA
RLYHDIYKND LYSYDNDSYS TQKTSRAFRS YYDDYSTGLS LEDEWTLDEN NQLRLAFHHK
QDVHREHDAG EPKQRFEDET QSLALEYTRK LTERLTLIAG FSHDRRNGRE AETYTSAAGL
FEEEGGREST NNGQLGLFFQ ADPRTLWRFT VARKSRFPTM KDRYSYRFGS ALPNPDLKTE
EATHFEIGYK RALSDSLELD LALFRSHVDD LLQSVRVAGS ACSNPPCSQM QNVSEARMNG
VEASLNGNLG AWEVNFNYLY LNRQNRSSDD SIRLTDTPKH KGFLSIGRWF GPWHLMASSD
ASSSRYSSTD GTQEASGFVV FNLKGGYRFD NGLQLDASVQ NLTDHEYEYS EGYPEPGRTF
VVQANLTF