Gene Ava_4140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4140 
Symbol 
ID3681216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5161596 
End bp5163398 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content43% 
IMG OID637719486 
ProductS-layer region-like 
Protein accessionYP_324634 
Protein GI75910338 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0020254 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0305803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAA TTTTATGGAA TTGTGGGCTA GTTGTCCCAG CCCTTTTTAA TGTGCTTTTC 
ATTCTTTCTT CAGGTGCAAT GGCAGAAGCC CCACCGCAGG CGATCGCACC GGAAACACTG
AATCTATCTG AATCAAAATC AGGGATTGAG GAAACTTCGT CAAATAGTGT AACTCAGGCT
GAATCAAATC AGATTGCCAA TGTAGAGGTA ATTGGCGCGC CTGATTACTT ACTATCTCAA
ACAGAAAATC TATTTTCTCA AGAAGATATC ACAGATGAAG AAAACCCCAT AGAACAAGTT
ACATCTGTTT CTCAGTTATC TGATATTCAG CCTACAGATT GGGCATTTCA AGCTTTACAA
TCCCTGGTGG AACGCTATGG AGCGATCGCA GGTTATACAG ATGGTACATT CAAAGGCGAT
CGCGCTCTTA CCCGCTATGA ATTTGCGGCT GGTCTAAATA CTGCTTTAGA CCGGTTCAAT
CAAATCATTG CCAATTCTAC AGCAGACTTA GTTAGACAAG AAGACTTAGA CACAATCAAG
AAGTTAAGAG AAGAATTTTC TACGGAATTG GCTGCACTGC GAGGAAGACT TGATACAGTA
GATGCAAAGT TAGAAACCAT CGAGAAACAA CAATTTTCTA CTACAGTTAA ACTCACAGGT
CGAGCGCAGA TAGTTATTGG CTCACTTTTT GCTGGTAATA ACGTTATCAC TGGGCGGCCA
GCACCCCGCG TAGTAACACA GCAAGGATCA GTGTCTTTGC GATTAAATGC CAGCTTTACA
GGTAAAGATT CACTGGGTAT AACGCTGGGG GGAGGAAATA TTCAATCATT AGGACAAACA
AGAGCTGGAT TATTAGGCAC TTTTGATGGC AGAACTGCTG ATAACTCCAG TATTACCAGA
CCACCCAATG ATATTTCTGT TAGTGGTGTA CGTTATCGGT TTCCTTTTGG TTCAAATACC
CAAGTCAACA TTTATGCTTT ATCCGATGGA GCTAATGAGC TAGGTTTTAC CGTTCCGATT
AATCCATACT TTGAAAGTAG TCTAGCAACT GGTTCTAATG GGATTTCCCG ATTCTCACGA
CGAGCTTTAG TCTATCAATA TGGAGATGCT GGCGGTGGAA TAGCAGTACT CCACAGATTA
AATCAACAGT TCCAATTGGG AGTAGCTTAT AGCGCACCTA ACGCCAATAA CCCTGGCCCC
AATACCGGCT TCTTCACAGG CCGATATTTA GCTTTAGGAC AGATACTATA CACCAGTCCT
CAGAGGAATT TTCGGGCGGC TCTAACTTAC GTTAATACTT ATAGTCCACC AAACGCCCAA
GGTTTAAGTG GAACAAACTT TGGCCCAGCA GCAGGAAGTA ACTTGGTCAA TAGCACCGTA
GCAGGAACGG GGACAGTAGC AAATCTTTAC GGCGTACAAG CTTTTTATCA ATTTAGTCCC
AAGTTTGCTA TGAATGGTTG GGTAAGTTAT GGCGCACACC GCTATTTAGG ACGCGGTGAT
GGCCGAGCTA TGGATTGGGC TGTAGGAATG TCGTTCCCGG ATCTTGGAAA AAAGGGAAGT
CTAGGGGGAT TGTTTGTGGG TATGGCTCCA ACACTGATCA GTCTTGGCAA AAATGTGAAT
TTGGGAGCAG GCTTAGGACA AGCAGACAAA GACCTTTCCC TACATATTGA AGGATTCTAC
CAATACAAAA TTAACGATAA AATCGACATT ACACCAGGTT TTATTTGGGT TACAGCGCCA
GATTCCAATG CCAACAATCC TGATAGTGTA TATGCTTGGA TTCGTACTAC CTATAGGTTT
TAG
 
Protein sequence
MFKILWNCGL VVPALFNVLF ILSSGAMAEA PPQAIAPETL NLSESKSGIE ETSSNSVTQA 
ESNQIANVEV IGAPDYLLSQ TENLFSQEDI TDEENPIEQV TSVSQLSDIQ PTDWAFQALQ
SLVERYGAIA GYTDGTFKGD RALTRYEFAA GLNTALDRFN QIIANSTADL VRQEDLDTIK
KLREEFSTEL AALRGRLDTV DAKLETIEKQ QFSTTVKLTG RAQIVIGSLF AGNNVITGRP
APRVVTQQGS VSLRLNASFT GKDSLGITLG GGNIQSLGQT RAGLLGTFDG RTADNSSITR
PPNDISVSGV RYRFPFGSNT QVNIYALSDG ANELGFTVPI NPYFESSLAT GSNGISRFSR
RALVYQYGDA GGGIAVLHRL NQQFQLGVAY SAPNANNPGP NTGFFTGRYL ALGQILYTSP
QRNFRAALTY VNTYSPPNAQ GLSGTNFGPA AGSNLVNSTV AGTGTVANLY GVQAFYQFSP
KFAMNGWVSY GAHRYLGRGD GRAMDWAVGM SFPDLGKKGS LGGLFVGMAP TLISLGKNVN
LGAGLGQADK DLSLHIEGFY QYKINDKIDI TPGFIWVTAP DSNANNPDSV YAWIRTTYRF