Gene Ava_4967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4967 
Symbol 
ID3679076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6247033 
End bp6249129 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content39% 
IMG OID637720325 
ProductTonB-dependent receptor 
Protein accessionYP_325459 
Protein GI75911163 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.655024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAAAGA GCTTTTTTTG GCTGCCCGTT GCTTTTCCTA GTTTATTTTT AGCATTTCCG 
GCTTTGGGTA GTGATACGGA AATTACTCAA ACAGTAGATA ATTCTGATAT TCCGTATTTG
AGTGAAATTG AGTTACCCAC GACCAATGCT GAATTATTAA CTCAATCGAC ACCAGGAGAA
TTAAATTCAG ACAGTGAACC TAAGGAACAG CCAGAAATTG AGGAAACTTC TAGTGATGAT
GCAGACATTA CCATAGAAGC GATCGCCGAA CCAGAAACTC TGCCAGTATC TACTCCCACT
TACGTTATTG ACCAAGAAGA AATTCAAAAA CAAGGCGCTA CCAGTGTCGC TGATGTATTA
AAAAGAATGC CTGGTTTTGC TATTAATGAT GTTGGTCATG GTGCAGATAT ACACACGGGT
ACTTATTACC GGGGAGCCTC AATTAATCAG TCTGTATTCC TCATCAATGG CAGACCAATT
AACAATGATG TCAACACATA TCATGGTGCA ACTGACCTTA ATAGTATTCC TGTAGAATCT
ATTGAGCGTG TCGAATTATC TAGCGGCGTG ACCTCTGCTT TATATGGTTC CTCTGCTTTT
GGGGGAGTGG TGAATATCAT CACCAAAAAA GGTTTTGCAC AACCTAAATT GACGAGTAGT
TTAGAATTTG GCTCACTCAA TCTGAATAAT CAACAGTTTA GTTATAGTGG TTCAGTTGGT
GCGGCAAATT ATAATTTTAG CTTTGAGAGA TATTTTGTTG ATAACCGTTA TCGTGTGCCT
GTAGGTGCAG CCAATCGTGA CTCCCAAGGG TTTTTATCGA ATGCAGATAC CTCTACTAGC
ACCTACTTTG GCAATATTGG CTTAGATTTA GATCAAAGAA ACTCGTTAAG TTTAGATATT
ACTAAACTCA GTAGTCGTAG AGGCTTAGTT TATTTCGGAT TTCCCCTACA AAGAGATAGA
TTAGACCATG ATGGTTTAAA TATTGGTTTA TCTTGGAAAA CTCGCCTTGG GAATGGCGAT
AACTCCAACC TGACCACTAC CTTTGGTTAT AACCAAAATT ATTTCAGCAC CTATGGCCCT
ACAGTTTTTG CAGGTAGAGA ATTTTATCGC ACTGGTGTTT TAGATACACA ACAATTCACT
GGTAGGATTG ATCATGATTG GAAAATTTCC CCAAATAATA AATTGCGTTG GGGACTAGAT
TTAAAAAATA CTGATTTAAG TGGTGATGTT TTAAGCTCTA GTCCGAATCG GACGGCTTTT
AACGAATCTG AAAATAGAAA TGTATTAAAT ACAGCTTTAT TTGCTGTGAA TACATGGAAT
CTGAGCAATA GTTTTCAGTT AGATTTAGGA CTGAGACAAA GTTTTGATGG TCAGTTTGGC
AATTATCTTA ACCCTAGTGT GGGCTTACGT TACGACATTA CACCATTAAT TGCTATGCGT
GGTAGTTGGG CTGGAGGACA GCGCAACCCA GGTTTAGACC AGTTGTACGT TTATGATACA
GTTCATGGTT GGGAACCAAA CCCCGATTTA GAGCCGGAAA CAGGTTCCTC TTGGACTGCG
GGGGTAGATG TCAAGCTTGC TGATAATTTA ACAGGACAGT TTACCTATTT TGGCAGCAGT
CTGAATAATC GCTTAGGAAT TGTGGCTGGA AGATGGGCAA ATATTGGCTT AGTCGATACC
AATGGTTTAG AAGCCGCACT GCAATTAAAA GTTGCTAATA ACTGGTCAAC TTTTCTCAAC
TACACTTATA CAGATGCCCA AATTAAGACA GGTTCAGAAA GAGGTTTACA GTTGGGGATG
ATTCCCTATT CTGTATTGCA AACTGGTGTA GGTTATCAAA ACTCAGGTTG GCAGGCTAAT
TTGTATGTTA CCTACAATAG TGGCGCTCGT AGAGCCTTTT TCAACCGACC AGGTGAGACA
ACTACAGATT TCGTCTCATC TTTTGTCAAT TTAGATTTGA GTGGTCGCAT CCCCTTAACT
CGTACTTTAG GACTAACAGT TTACTTAGAA AATTTACTAG GTGAACAATA TGAGCGAGTG
AATCGTATTT ATAGTCCTGG GTTTACTTTT CGCTTGGGTT TAAGTTCTAG TATTTAA
 
Protein sequence
MKKSFFWLPV AFPSLFLAFP ALGSDTEITQ TVDNSDIPYL SEIELPTTNA ELLTQSTPGE 
LNSDSEPKEQ PEIEETSSDD ADITIEAIAE PETLPVSTPT YVIDQEEIQK QGATSVADVL
KRMPGFAIND VGHGADIHTG TYYRGASINQ SVFLINGRPI NNDVNTYHGA TDLNSIPVES
IERVELSSGV TSALYGSSAF GGVVNIITKK GFAQPKLTSS LEFGSLNLNN QQFSYSGSVG
AANYNFSFER YFVDNRYRVP VGAANRDSQG FLSNADTSTS TYFGNIGLDL DQRNSLSLDI
TKLSSRRGLV YFGFPLQRDR LDHDGLNIGL SWKTRLGNGD NSNLTTTFGY NQNYFSTYGP
TVFAGREFYR TGVLDTQQFT GRIDHDWKIS PNNKLRWGLD LKNTDLSGDV LSSSPNRTAF
NESENRNVLN TALFAVNTWN LSNSFQLDLG LRQSFDGQFG NYLNPSVGLR YDITPLIAMR
GSWAGGQRNP GLDQLYVYDT VHGWEPNPDL EPETGSSWTA GVDVKLADNL TGQFTYFGSS
LNNRLGIVAG RWANIGLVDT NGLEAALQLK VANNWSTFLN YTYTDAQIKT GSERGLQLGM
IPYSVLQTGV GYQNSGWQAN LYVTYNSGAR RAFFNRPGET TTDFVSSFVN LDLSGRIPLT
RTLGLTVYLE NLLGEQYERV NRIYSPGFTF RLGLSSSI