Gene Ava_5004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_5004 
Symbol 
ID3679056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6289793 
End bp6291613 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content45% 
IMG OID637720364 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_325496 
Protein GI75911200 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00831315 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.753431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCCG ATCACGATTT TGATCAAGCT TCTCAAGAGC AGAATGAAAC AACAAATCCT 
CAATCTCGTA GTAGTGGAAA AAGACGTGGT AGACTTTTTG GGCGAGACGG TGTTAGTAGA
CGTTTATTTC TTGGTCACGC TGGTTTGTTT ACTGCTACTA GCGTTGTGAC TGGGATGATC
GGTTCGTCTT TCTCAGGTTC CAAACAGGGA GATATTGCAC AAGCTAGAGA AATTGCTCAG
GGTAGATATC GTGTTCGAGA TTTTAACTAT ATAAAATTTC GTCAGCAAGC CTTTCAAGTC
CGTCTCCAAG CAGCGCAAAA TAATCGGCAA ATTGATATCC CACCCCATCC TACCAATGGC
GATGAGGAAC GCTATGCCAA CAAAATTGCT ACAGACACGA GGGGATTACC ACACGATCAA
AGAGGCGAAG TTGATCTGCA AGCTTATGAT TCTTTGATTA GGGCATTAAC AACACAGAAC
CCAGATGATT ACGAAAGGAT TATTTTGGGT GGTACAAGGA AGTTAGTGAA TCCTCAAGGG
CCGTTGGCAA TTAGTTTAGA AGGCATCAAT GCTTCTCAGA TAGCAGTTCC ACCGCCACCA
ACCCTAAATA GTGCAGAACA AGCGGCAGAG GCGATTGAAC TCTACTGGCA GGCTTTATTA
AGGGATGTGC CTTTCAGTCG ATTTGCTCAA GATCCGAATG TCGCCGCAGC GATCGCCGAA
CTTAATAGCC TACCGGAGTT CCGGGGGCCA AAACAGAACG GTTTTGTCAC TCCGCAAACT
TTATTTCGTG GTAGCGTTAT CTATGTTGAT CAGCGCGATC GCTCAGGTAG GACAACAAAA
TACGTTACTC CTCCAGGAGT GCTAGATGGC CCCCACATTT CCCAATTTCT GTTACGGGAA
ATTCCTTACA ATACTCAATT CATTTCACCC TTGATTCGTA CTGCTCTAGC TGGTAAGGAG
AATGATTTTC TCACTAATTA CAATGAATGG CTAACTGTCC AGAATGGAGG TAGTTCTGGC
AAGTCCATTA AGTTCGATCC CACACGCCGT TTTGTTTTTA CTGTTCGTGA TCTGAGCGAA
CTATCACGTA TTGGTGGGGC TTTGTTCTTT GGAGCTTTGT TAATTCTCAA TAGCATTAGC
GCACCGTTAA ATCCAGGAAA TCCCTATATC AACTCCAAAA CTCAAGTTGG TTCTGCTGCT
ACCTTTGCAT CGGCACATTT TCAAGCCTTA CTCAACTTAG CTCCCTCGCG GGCAATCAGA
GCTTCTTATT GGCAAAAGTT TTACGTACAT CGACGTTTAC GACCAGAAGC TTATGGTGGA
TTGGTTTATA ACAACATTGT CAATGGAACT AGTTATCCGA TTAATTCCCA AGTCTTCAAT
TCCACAGCCT TAGCTCGCAC CTTCAGCACT TTTGGTACTT ATTTGTTACC CCATGCGTAC
CCAGAAGGCG CACCATTCCA CTCTTCTTAC ACTGGTGGTG CTGCTTCGAT TGCGGGTGTG
CAAGCTACGT TGTTGAAAGC CTTTTTTAAT GAGAATTTTG TGATTCCCAA TCCTGTAGAA
CCTGATCCCA ATGACCCCAC CAAACTAATT CCCTACAGTG GCCCAGCTTT GACAGTCGGT
GGTGAGTTGA ATAAACTGGC GACGAACTAT TACATCGGTC GTGGTCATGG CGGTATTCAT
TGGCGTTCCG ATGGTGCAGC TGGCTTAGCG TTAGGTGAGG AAGTTGCTAT CAGTATTCTT
AGAGATGAAA GACTGGGATA CAACGAACGG TTTAATGGTT TTACCTTCAC CAAATTTGAC
GGTACAAGAG TGACTGTCTA A
 
Protein sequence
MQPDHDFDQA SQEQNETTNP QSRSSGKRRG RLFGRDGVSR RLFLGHAGLF TATSVVTGMI 
GSSFSGSKQG DIAQAREIAQ GRYRVRDFNY IKFRQQAFQV RLQAAQNNRQ IDIPPHPTNG
DEERYANKIA TDTRGLPHDQ RGEVDLQAYD SLIRALTTQN PDDYERIILG GTRKLVNPQG
PLAISLEGIN ASQIAVPPPP TLNSAEQAAE AIELYWQALL RDVPFSRFAQ DPNVAAAIAE
LNSLPEFRGP KQNGFVTPQT LFRGSVIYVD QRDRSGRTTK YVTPPGVLDG PHISQFLLRE
IPYNTQFISP LIRTALAGKE NDFLTNYNEW LTVQNGGSSG KSIKFDPTRR FVFTVRDLSE
LSRIGGALFF GALLILNSIS APLNPGNPYI NSKTQVGSAA TFASAHFQAL LNLAPSRAIR
ASYWQKFYVH RRLRPEAYGG LVYNNIVNGT SYPINSQVFN STALARTFST FGTYLLPHAY
PEGAPFHSSY TGGAASIAGV QATLLKAFFN ENFVIPNPVE PDPNDPTKLI PYSGPALTVG
GELNKLATNY YIGRGHGGIH WRSDGAAGLA LGEEVAISIL RDERLGYNER FNGFTFTKFD
GTRVTV