Gene Ava_3507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3507 
Symbol 
ID3679609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4352000 
End bp4355023 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content43% 
IMG OID637718859 
Productprotein prenyltransferase, alpha subunit 
Protein accessionYP_324009 
Protein GI75909713 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.223985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAGC AACTTTGGCA GTGGCTCAAG AGGTCTTTTG GGCGTTTATT TGGCAGAAAG 
CATTCCCCAG TCAGGGAACA GAACAAAGTA GAACCACCGC AACGGTTAAC GGATGCGGAG
TATGAATCGT TGTTTCTCCA GTTGTTAGCA GAAGTCAATG ATGGCTTGAC TAGAGGAGAA
GCAAAAGGTT TCTTGGCTGC AAAGCACATC AATGAAGGTG ATTTGGTGGA GTGGTTGCGG
GGTTTTGGCG AAAGATTGTT CGCTTCAGCT AAGCCAAATG ATGAATTGGT AAGTCGGATG
GTGCGGCTGG GTGAGTTGAG TATTGGGGAA GTTAGTGATG TTGCGGGTGA TATTGGGAGG
CGGTTGGGGG GAGGAGAAAC GAACCGCAGA GGCGCAGAGG ACGCGGAGGA AGAAGAAACA
AGTAATCAAT TCAATGATGC TATTGAATTG ACGAGTGACG AAGCAGAGGC TTGGTTAAAT
CAAGGTGTGG CACTGGCTAA TTTAGGGCAA TTAGAACAAG CAATCACATC TTTTGACAAA
GCTATAGAAT TCAAGCCTGA CGATGACTCA GCTTGGTACA GCCGGGGTGT GGCGCTGTGT
AATTTGGGGC GATTTGAACA AGCGATCGCT TCTTATAATA GGGCTATAGA ATTCAAACAT
AATTTTCCTG AAGCTTGGAC TAATCGTGGG GTAATCTTAA ATAGCCTCAA ATTATATCAA
GAAGCACTGA CCTCGTTTGA AACTGCTTTG CAAATCAATC CCAACTTTCC AGAAGTATTC
AATGCTTGGT ATGGTAGGGG TAACACACTA TTCAATTTAG AGAAATTTGA AGAAGCGATC
GCATCTTATG ACAAAGCCAT AGAATTCAAA GCTGACGACT ACTCAGCTTG GTACAACCGA
GGTGTGGCGC TGGATAATTT AGGGCAATTT GAAGAAGCGA TCGCATCTTA TGACAAAGCT
ATAGAATTCA AAGCTGACGA CTACTCAGCT TGGAATTACC GAGGTGTGGC GCTGGCTAAT
TTGGGGCGAT TTGAAGAAGC GATCGCATCT TATGACAAAG CCATAGAATT CAAAGCTGAC
GACTACTCAG CTTGGTACAA CCGAGGTGTG GCGCTGAGTA ATTTAGGGCG ATTTCAAGAA
GCGATCACAT CTTATGACAA AGCCATAGAA TTCAAAGCTG ACTTTTACAT AGCTTGGATG
AACCGAGGAA TTGTAGCTGG AAACGTAATA GTAGAGAGGA TAGATTTCTC TACATTTCCT
TTACCTCATG CGGCAGCACA TAACCTAGCT TTAAAATTTA ACAATCCAGA TTTAAATAAG
CGTGGCTATG AGGGAAGATT AGCCAGCTAT GAGGAAGGAT TAAAACATTG TCAGCAAGAA
ACTCACCCAG AAGGTTGGGG AAAATTGCAT CGGGCGATAG GTGATTCTCA TTATTACCAA
GGGCGAGGTA ATTATAACAC TCGCTATTTT TGGCGCAAAG CTATTAACAG TTACAAAACG
GCACTGCAAA CTCTCACAGC AACAAATTTT CCTGAGTTGC ATCTGGAAGT TTTGCAAGAT
TTAATTCGCG TGCTGTTAGA TTTGGGAGAA ATAGCCGAAG CTACAGAACT CCAGCGCCAA
GGTACTGAGT TGTTGCGGCG CTTATTAAAT GAACCAAATC GCTCTGAGCG AAGTAAAAAA
CAACTAGCTT TAAAATTTGC TTGGATTCAG CAGTTAACTG TTGATTTAGC CGTGCAGTCT
GGAGATTTGC TGCAAGCGAT TGAATTGGCA GAAGAAGGCA AAAATACTTG CTTGCGTTGG
CTTTTGGATG GATGGAGCGA TGAAATTTCT TCTCCTAATT ATTCAGAAAT ACAACAACTG
CTTAATCCTT CAACTGCCAT TGTTTATTGG CATCTCAGTT CTTACGCCTT ACATACTTTC
ATTCTTAAAC ATAACGCCCC GTCACCAATT GTTTTAGGCA ATACCGAGTG TCTAACTCAG
GCGCAACGTT TACGCGATTT TGAAGCCTGG GTGAAAAAAT GGAACGAACA ATACGCCAAT
TATCCCAAGG ATAAAGACAA GCAAGGCGAA AAAGATAGAA CATGGCGCGA CAATTTACCC
GAAATGCTGC GGAATCTGAG TCATATTCTC GATATAAATG CTGTCGTCTC CACAATTCCA
GATATTACTC AATTAATTCT CATTCCTCAC CGCGATTTGC ACCGCTTCCC TCTTCATGCA
CTGTTTCCAC CTGAATTTAC CATCAGCTAT TTACCCAGTG CAAAAATTGG CTCTATCTCT
GTTAAGAAAG ACAATTATAA CCAAAAAAAT TTACTTAGCA TCGAACATCC CAACAGTACG
GGTTATCCCT CATTAGATTT CGCCGAAATC GAATCAGAAG CCATCAGCCA AATGTTTGCC
AACCCTACAC GTCTGCATTC TGAACAAGCC ACACAAAAAG CACTGATAAA TGCTTTGCCC
CAAAGCTACA ATATTTTTCA CTTTACAGGA CATGGTGTAT ATAACTTCCA AAATCCGGCG
TTATCTTTTT TAGCATTAGC AGATGAAGAC AAGTTAACTC TAGCGGATAT CCACGGCTTT
AAATTGCAAA GCTATCAACT CGTCACCTTA GCAGCTTGCG AAACTGCAAT CACCGGAAAT
CACACCATCA CTACAGAATA TGTAGGGCTT GTCAGTGGCT TTATGGGTTG CGGTGTGGCT
CATGTCGTCA GTACCCTGTG GACTGTAGAA TCAGCCGCTA GTGCTTTGGT GATGATTCAG
TTTTACCAAC TGCTGCAACA AGGTAAACCA GAAACTATAG CTTTAGCTGA AGCTACCCAA
TGGTTGCGAA ATGTCACCAA TGCAGAACTA GCACAATGGT ATGCAGCGCA ACTTGCCAAA
GTTCCTGAAA ATCAAGGACT CCTTTACAAT TGCTGGTCAC GCCATTTAAA TAAACTTAAG
AATAACCCAG AACCTAGTAA ACAACCCTAT AATCACCCTT ACTTCTGGGC AGCTTTTACT
ATTACTGGCA ACTTTTCACA ATGA
 
Protein sequence
MLKQLWQWLK RSFGRLFGRK HSPVREQNKV EPPQRLTDAE YESLFLQLLA EVNDGLTRGE 
AKGFLAAKHI NEGDLVEWLR GFGERLFASA KPNDELVSRM VRLGELSIGE VSDVAGDIGR
RLGGGETNRR GAEDAEEEET SNQFNDAIEL TSDEAEAWLN QGVALANLGQ LEQAITSFDK
AIEFKPDDDS AWYSRGVALC NLGRFEQAIA SYNRAIEFKH NFPEAWTNRG VILNSLKLYQ
EALTSFETAL QINPNFPEVF NAWYGRGNTL FNLEKFEEAI ASYDKAIEFK ADDYSAWYNR
GVALDNLGQF EEAIASYDKA IEFKADDYSA WNYRGVALAN LGRFEEAIAS YDKAIEFKAD
DYSAWYNRGV ALSNLGRFQE AITSYDKAIE FKADFYIAWM NRGIVAGNVI VERIDFSTFP
LPHAAAHNLA LKFNNPDLNK RGYEGRLASY EEGLKHCQQE THPEGWGKLH RAIGDSHYYQ
GRGNYNTRYF WRKAINSYKT ALQTLTATNF PELHLEVLQD LIRVLLDLGE IAEATELQRQ
GTELLRRLLN EPNRSERSKK QLALKFAWIQ QLTVDLAVQS GDLLQAIELA EEGKNTCLRW
LLDGWSDEIS SPNYSEIQQL LNPSTAIVYW HLSSYALHTF ILKHNAPSPI VLGNTECLTQ
AQRLRDFEAW VKKWNEQYAN YPKDKDKQGE KDRTWRDNLP EMLRNLSHIL DINAVVSTIP
DITQLILIPH RDLHRFPLHA LFPPEFTISY LPSAKIGSIS VKKDNYNQKN LLSIEHPNST
GYPSLDFAEI ESEAISQMFA NPTRLHSEQA TQKALINALP QSYNIFHFTG HGVYNFQNPA
LSFLALADED KLTLADIHGF KLQSYQLVTL AACETAITGN HTITTEYVGL VSGFMGCGVA
HVVSTLWTVE SAASALVMIQ FYQLLQQGKP ETIALAEATQ WLRNVTNAEL AQWYAAQLAK
VPENQGLLYN CWSRHLNKLK NNPEPSKQPY NHPYFWAAFT ITGNFSQ