Gene Ava_4785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4785 
Symbol 
ID3679438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6016505 
End bp6017626 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content45% 
IMG OID637720141 
Productpeptidase M50 
Protein accessionYP_325277 
Protein GI75910981 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.384856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGGAA CAATTCGCGT TGGTAATCTC TTCGGTATTC CTTTTTATAT CCATCCGTCG 
TGGTTTTTAG TTCTGGGTTT AGTTACCTGG AGTTATGGCG GTGGACTCTC AGCAGAATTT
CCCCAACTAT CTGGGGTGAT GGCTTTGGGA CTGGGACTGA TAACGGCGTT GTTATTGTTT
GCTTCTGTCG TCGCTCATGA ATTAGGACAT AGCTTTGTCG CCATCCGTCA AGGAATTAAC
GTTAATTCCA TCACACTATT TATCTTTGGT GGCTTGGCTA GCTTAGAAAA AGAGTCCAAA
ACACCAGGTG GAGCCTTTTG GGTGGCGATC GCCGGGCCTC TAGTCAGTTT ATTATTGTGT
GGTATCGTCA CGGCAATTGG TGTGACTACG GCAGTTACAG GGCCATTGGC AGCAATTCTG
GGAGTTCTGG CTTCTGTAAA CTTAGCTTTG GCATTGTTTA ACCTGATTCC TGGCTTACCG
TTGGATGGTG GAAACGTCCT TAAAGCCATT GTTTGGAAAG TAACAGGTAA TCCCTATAAA
GGTGTCACTT TTGCTAGTCG TGTAGGACAA GTATTTGGTT GGGTGGCGAT CGCTTCTGGT
ATTTTCCCCA TACTATATTT TGGTAGCTTC GCCAACGTGT GGAATCTGTT AATTGGCTTC
TTCTTGCTAC AAAATGCTGG TAACGCAGCC CAATTTGCCA GAGTGCAAGA AAAACTCACA
GGCTTAACAG CAGCCGACGC TGTAACGACC GATAGCCCTA TAGTTTCTGC CCATCTTAGC
CTGAGAGAAT TTGCTGATGA TCAAATCGTT CAAGGACAGA ACTGGCGACG GTTTTTAGTT
ACCAACAACG CAGGACAATT GGTAGGTGCG ATCGCTCTTG ATGACTTGCG AAACATCCCC
ACTACATCCT GGACAGAAAC TCAAATTCAA CAGGTGATGC GGCCAATTCA ATCTACCACC
ATCAAATCTA GTCAACCATT GTTAGAAGTA GTGCAATTAC TAGAACAACA AAAATTGTCT
GCCCTCCCCG TAATTCTCGA CAATGGTGTA CTACTAGGCA TTTTAGAAAA AGCCGCTATC
ATCCAGCTAT TGCAAAACGG AACCCAACCT AGCCCTGCAT AG
 
Protein sequence
MNGTIRVGNL FGIPFYIHPS WFLVLGLVTW SYGGGLSAEF PQLSGVMALG LGLITALLLF 
ASVVAHELGH SFVAIRQGIN VNSITLFIFG GLASLEKESK TPGGAFWVAI AGPLVSLLLC
GIVTAIGVTT AVTGPLAAIL GVLASVNLAL ALFNLIPGLP LDGGNVLKAI VWKVTGNPYK
GVTFASRVGQ VFGWVAIASG IFPILYFGSF ANVWNLLIGF FLLQNAGNAA QFARVQEKLT
GLTAADAVTT DSPIVSAHLS LREFADDQIV QGQNWRRFLV TNNAGQLVGA IALDDLRNIP
TTSWTETQIQ QVMRPIQSTT IKSSQPLLEV VQLLEQQKLS ALPVILDNGV LLGILEKAAI
IQLLQNGTQP SPA