Gene Ava_1981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1981 
Symbol 
ID3681544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2460693 
End bp2462099 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content40% 
IMG OID637717322 
Productcytochrome P450 
Protein accessionYP_322498 
Protein GI75908202 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0150704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.856547 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTATC AAATAAAGAG ACCTAATCCG CTAAAAACTC ACCCTTTTCT ACAAAAACTC 
CAATGGATTG CTGACCCTGT AGAATACATG GAAAAAGCAT CTCTGCAACA TCGTGATATG
TTTACAGCAG AGGTAATTGG TTTTGGCGAC ACTGTAGTGT TTGTTAGCCA CCCTCAGGGA
ATTCAGACAA TCTTTGCCAA TGACAGAAAA AAGTTAGTAG CTGTTGGCGA AGCAAACAGA
ATTTTGTATC CTTTAGTGGG TAACAATTCT ATGTTCCTAC TAGAAGGAGT AAAACACAAG
CAACGACGAC AACTCCTTAT GCCCTCCTTT CATGGGGAAA GAATGCGCGA GTATGGCCAC
TTAATCAGAA ATATTACTGA AACCCTTTTT AGTCAGCTAC AACAGAATGT TACTTTTTCT
GCTCTCACAG CAATGCGAGA AATCTCGATG CAGGTTATTT TACAAGCTGT GTTTGGTTTT
TATGAGGGAG AACGTTGTCA ACAATTCAAA CACTTATTAC CAGTTTTTCT TTCTGAATTA
TTTCAGTCCC CATTGGCTTC TAGTATCCTC TTTTTTCCTT TCCTCCAAAA AGATTTAGGT
AATTTGACTC CCTGGGGAAG ATTTGTCCGC CAAAGAGAGA AAATTGATAA ACTGCTTTAT
GAAGAAATTG CTGAACGCAG ACAGGAAATA AATTCTGATC GTATTGATAT TCTTTCCTTG
CTCATATCGT CTAGAGATGA GACAGGTAAT TCCATGTCAG ATCAAGAGTT GCGGGATGAA
TTGATCACCT TGATGATTTC TGGACATGAA ACTACAGGGA CAGCTATGGC ATGGTCTTTA
TATTGGATTC TGCAAACTCC AGAAGTATTT CAAAGGCTGA TCCAAGAGTT AGATAGCCTC
GGTGATTCTC CTGATCCTAT GAGTATCTTT CGGTTGCCAT ATCTTACAGC TGTCTGTAAC
GAGACCTTGC GAATTAACCC TGTTGCCATG TTAACACTAC CTAGAGTAGT GAAAGAACCA
GTTGAGCTAC TGGGAAATCG ACTAGAGAGT GGTACAACAG TAGTTGGCTG CATTTATCTG
ACTCATCACC GAGAAGATTT ATATCCCGAA TCAAAGCTAT TTCAACCAGA GCGGTTTCTC
AAACGTGAAT TTTCCCAGTA TGAATTTATG CCATTTGGTG GTGGTGTACG TGGTTGTATT
GGTCAGGCTA TAGCTATGTT TGAAATGAAG ATAGTATTAG CAACAGTCCT ATCACGTTAT
CAATTTGCAC TGGCAGATGG AAAACCAGAA CGTCCTCAGC GTCAAGGTTT TACTCTTACA
CCTGCCAACG GAGTTAAGAT GTTAATCACA GGAAAACATC AGCGTCAAAA CTATTCAACT
GCTGCCTCAA CAACATTCAC AACATAG
 
Protein sequence
MKYQIKRPNP LKTHPFLQKL QWIADPVEYM EKASLQHRDM FTAEVIGFGD TVVFVSHPQG 
IQTIFANDRK KLVAVGEANR ILYPLVGNNS MFLLEGVKHK QRRQLLMPSF HGERMREYGH
LIRNITETLF SQLQQNVTFS ALTAMREISM QVILQAVFGF YEGERCQQFK HLLPVFLSEL
FQSPLASSIL FFPFLQKDLG NLTPWGRFVR QREKIDKLLY EEIAERRQEI NSDRIDILSL
LISSRDETGN SMSDQELRDE LITLMISGHE TTGTAMAWSL YWILQTPEVF QRLIQELDSL
GDSPDPMSIF RLPYLTAVCN ETLRINPVAM LTLPRVVKEP VELLGNRLES GTTVVGCIYL
THHREDLYPE SKLFQPERFL KREFSQYEFM PFGGGVRGCI GQAIAMFEMK IVLATVLSRY
QFALADGKPE RPQRQGFTLT PANGVKMLIT GKHQRQNYST AASTTFTT