Gene Ava_3483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3483 
Symbol 
ID3679795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4317292 
End bp4320456 
Gene Length3165 bp 
Protein Length1054 aa 
Translation table11 
GC content44% 
IMG OID637718835 
ProductTPR repeat-containing protein 
Protein accessionYP_323985 
Protein GI75909689 
COG category[S] Function unknown 
COG ID[COG1511] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.423026 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCTG GCATGGGTAT TAATTCCAGA AAGCGATCGC TCGTTGATGG CAATCGGGCG 
TTAAATTTAT TTACAGACCG TCATGAATTA ACTCGTGTCT TTGCTGCGTA TTTACATGAC
GAGCCAGCAG AGAAAATATT ATCTTTTTCT GGCGATGGTG GTAACGGTAA ATCTTTACTA
TTGAAGTTTT TACGCACCAA GTGTTGTAAG CGGTTTGGTG CTGATGCTTG GCAGAAATTA
AAGACGAAAA CGGCGGCGGA AATCGCAGAT TATATAGAGT CTGCGGATAA TGATCAGTGT
GACCTAGTAC CAGCGATTTT ACAGGACTTT GGGCTACAAC CGAATGGCGA TGACCAACCC
CAAGACCCAT TTTATGGGTT GTTGATGCTT AGGCGATCGC TTTCACGGGC GGCGACAGAA
TTAGGATATC GACTGCGGTT TCCTTTGTAT GACTTTGCCT GTGTTTGGTA TCTCAAACAA
AAAAATCGCC TCACACGGGA GAAATTAGCA GAACTGTTTC CCTCCGAAGA AATGGACTTG
CTGATTGAAA TCGTCAATGC AGTCAGCGAC ACCTCTTGGG GAACTATCGG TAAAGCTGTT
TTCGGAATTT TTAATAAGCA TTTGGGAGAA AACCTGCTTT TACATTGGCA GAAGCGAGGA
CTTAAGAAAG AGGATATAGA GGAAATTCGG GGGATGGATG CGGAAACCGA ATTGATGAAC
GAACTACCGC GCTATCTGGC TCAAGATTTG AGTGCGGCGA TGTCCCAAGA GAAAGCACCA
CCAAGAATTG TTTTGTTTTT TGATACTCAT GAAGCTTTTT GGGGTGGGCA ACGCCAACAA
ACGGGCATAC TATACTTTCA ACGGGATGAA TGGCTGCGGT ATTTTTTAGC AGAGTTGGAC
TTGAAAGCGG GGATAGTCGC CGTCATTGCG GGAAGAGAAA CACCTCGTTG GGCGCAAGCT
GATAATTTTC AGATTCCGCA AAAATATATT GATATTCAGT TAGTCAATCA TCTTTCATCG
GCTGATGCAG ATGTGTATTT GCAACGTGCT GAAATTGGGG ATCAGGCTTT GCGGCAAAGT
GCGATCGCCT ACTCTAGCGT CACAGCCAAT CAAGTACATC CCTTGCTGTT GGGTTTGTCT
GCGGATGTGA TATTACAGGC GCAAGAACAC CTCACACCAG AAGATTTTCC CAAGCAAGAG
GCGACATTAA ATAAAGCCAA ATACCTCATG AACCTGCTGC TGAAATATAC AGATAGAGAA
TTTGGTTATG CTGTTCATGC TTTAAGTGCT TGTCGCTCAT TTAACTTTGA AATTTACCGT
CTACTAGCAG AGGAATTGCA TTTTTCTACC ACCAAACCAG CCTTCGACAT CCTCACAGAA
TTTTCCTTTG TTTGGGATGT GGAAAAGTTG GGTGAAAATT GGTATCGCAT CCACGACTTG
CTACGGCGCT TAAACTATGA AAACAGTAAT GAAATTACTC AACAGGCTCA TGTTGTTTTA
GAAAAACACT ACCGCCAACA GGGACAAGTC GCAGAAGCAA TTTATCACGC TAACCGCTTA
GACTGGCGGC GGGGTGTGGA TGAATGGGAA GAAGTGTTTG AGCAAGCCTT GGAGTTGAGT
CGTTATGCAC AATGTCGTTC ACTGTTGGAA GTTAGGAGTG AGTTGGTAAT TAACAGCGAT
TTTCAAATCG GTAGAGTGTC CCAATCTGAA GGTGATTACT TTGCTCAATT AGCTAAATAT
CAAGAAGCGC AAACAGAATA TTTAGAGGCT GTAGCTGCAT ACAACCGAGA ATTAAGCATT
ACCCCCGATG ATACCGCCAC TCTCAACAAC AAAGGGTTAG CGTTAGAAAG TTTAGGGAAT
TTGCAAACAC AACTAGCCCA GCATACTCAA GCCATACAAT CCTACACTAG TGCGATCGCT
GCCTATGACC AAGCCCTCAA TCTCGCTCCT AACTACACCC AAACTATCAA CAACAAAGGG
TTAGTCTTAA AAAATTTAGG GGATTTGCAA ACAAAACTCG CCCAGCATCC CCAAGCCATA
CAATCCTACA CTAGTGTGAT CGCTGCCTAC GACCAAGCCC TTAATCTCGC TCCTGACTAT
ATCAATGCTC TCAATAACAA AGGGGTAGCG TTACAAAGTT TAGGGAATTT GCAAACAAAA
CTCGCCCAGC ATCCCCAAGC CATACAATCC TACACTAGTG CGATCGCTGC CTATGACCAA
GCCCTCAATC TCGCTCCTGA CTATATCAAT GCTCTCAATA ACAAAGGGGT AGCGTTACAA
AGTTTAGGGA ATTTGCAAAC AAAACTCACC CAGCATACCC AAGCCATACA ATCCTACACT
AGTGCGATCA CCACCTACGA CCAAGCTCTT AATCTCGCTC CTGACGACAC TTATGCTCTC
AACAACAAAG GGAATGCGTT ACAAAGTTTA GGGAATTTGC AAACAAAACT CGCCCAGCAT
CCCCAAGCCA TACAATCCTA CACTAGTGCG ATCGCCACCT ACGATCAAGC TCTCAATCTC
GCTCCTGACG ACACTTATGC TCTCAACAAC AAAGGGTCAG TCTTAAAAAA TTTAGGGGAT
TTGCAAATAA AACTAACCCA GCATAGTGAA GCCATAGAAT CTTACACTAG TGCGATCGCC
GCCTACGACC AAGCCCTCAA TCTTGCTCCT AATTACACCT ATGCTCTCAA CAACAAAGGG
TTCGCGTTAC AAAGTTTAGG GAATTTGCAA ACAAAACTCG CCCAGCATAG TGAAGCCATA
GAATCTTACA CTAGTGCGAT CGCCGCCTAC GATCAAGCCC TCAATCTTGC TCCTAATTAC
ACCTATGCTC TCAACAACAA AGGGAATGCG TTAGCAAAAT TAGGGGATTT GCAAACAAAA
CTCGCCCAGC ATACCCAAGC CATACAATCC TACACTAGTG CGATCGCTGC CTATGACCAA
GCCCTCAATC TCGCTCCTCG CTACATCAAT GCTCTCAACA ACAAAGGGTT AGCGTTACAA
GGTTGGGGTA AATTACTTTT ACAGTTATCC CAAAAACCAG AAGCAGTCAA TCATTTACAA
GCAGCATTAG CCGTCTTTAA TAGCTCCTTA GCAATAGTTT CCGGTGATGA GAGTGTTCGC
AACTTAAGAG ATGAACTACA AGAATTTCTG GATAATTTAA CGTGA
 
Protein sequence
MSAGMGINSR KRSLVDGNRA LNLFTDRHEL TRVFAAYLHD EPAEKILSFS GDGGNGKSLL 
LKFLRTKCCK RFGADAWQKL KTKTAAEIAD YIESADNDQC DLVPAILQDF GLQPNGDDQP
QDPFYGLLML RRSLSRAATE LGYRLRFPLY DFACVWYLKQ KNRLTREKLA ELFPSEEMDL
LIEIVNAVSD TSWGTIGKAV FGIFNKHLGE NLLLHWQKRG LKKEDIEEIR GMDAETELMN
ELPRYLAQDL SAAMSQEKAP PRIVLFFDTH EAFWGGQRQQ TGILYFQRDE WLRYFLAELD
LKAGIVAVIA GRETPRWAQA DNFQIPQKYI DIQLVNHLSS ADADVYLQRA EIGDQALRQS
AIAYSSVTAN QVHPLLLGLS ADVILQAQEH LTPEDFPKQE ATLNKAKYLM NLLLKYTDRE
FGYAVHALSA CRSFNFEIYR LLAEELHFST TKPAFDILTE FSFVWDVEKL GENWYRIHDL
LRRLNYENSN EITQQAHVVL EKHYRQQGQV AEAIYHANRL DWRRGVDEWE EVFEQALELS
RYAQCRSLLE VRSELVINSD FQIGRVSQSE GDYFAQLAKY QEAQTEYLEA VAAYNRELSI
TPDDTATLNN KGLALESLGN LQTQLAQHTQ AIQSYTSAIA AYDQALNLAP NYTQTINNKG
LVLKNLGDLQ TKLAQHPQAI QSYTSVIAAY DQALNLAPDY INALNNKGVA LQSLGNLQTK
LAQHPQAIQS YTSAIAAYDQ ALNLAPDYIN ALNNKGVALQ SLGNLQTKLT QHTQAIQSYT
SAITTYDQAL NLAPDDTYAL NNKGNALQSL GNLQTKLAQH PQAIQSYTSA IATYDQALNL
APDDTYALNN KGSVLKNLGD LQIKLTQHSE AIESYTSAIA AYDQALNLAP NYTYALNNKG
FALQSLGNLQ TKLAQHSEAI ESYTSAIAAY DQALNLAPNY TYALNNKGNA LAKLGDLQTK
LAQHTQAIQS YTSAIAAYDQ ALNLAPRYIN ALNNKGLALQ GWGKLLLQLS QKPEAVNHLQ
AALAVFNSSL AIVSGDESVR NLRDELQEFL DNLT