Gene Ava_3476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3476 
Symbol 
ID3679788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4309660 
End bp4310985 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content43% 
IMG OID637718828 
Producthypothetical protein 
Protein accessionYP_323978 
Protein GI75909682 
COG category[S] Function unknown 
COG ID[COG3395] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.380975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.709145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACA AACCAAAAAT AATTGTCTTG GATGATGACC CTACAGGTTC TCAAACAGTC 
CATAGCTGTT TGCTGTTGAT GCGTTGGGAT GTGGAGACTT TACGCACAGG CTTGCGGGAT
GATGCACCGA TTTTTTTTAT CCTCACCAAT ACTAGAGCCT TACCGCCAGA ATCAGCCGCA
TCGGTGACAA GAGAAGTTTG TCAAAACTTG AAGGTAGCGC TGGCTGCTGA AGCCGTGAAT
GACTTTCTCA TTGTCAGCCG TTCTGATTCT ACCTTGCGCG GACATTATCC CATTGAAACC
GATGCGATCG CTCAAGAACT AGGATCATTT GATGCTCATT TTCTTGTCCC TGCTTTTTTT
GAAGGGGGAC GCATTACCCG TGACAGCATA CATTACCTAA CTATTGATGG TGTACCTACC
CCAGTTCACG AAACTGAATT TGCTCGTGAT TCTGTATTCG CCTACCATCA CAGCTACTTA
CCTAAGTATG TGGAAGAAAA AACTCAAGGC GGTATTAATG CGGAGTCCGT AGAACGATTC
TTACTAAGTG ACATTCGTAC TGGAAGCTTA GAACGCTTAC TCAAGCTTAC AGATAATCAG
TGCGCTGTGG TGGATGGAGA AACTCAAGCG GATCTCAACC GTTTTGCTGT AGATGTATTA
GCAGCAGCTA GTCAGGGGAA ACGTTTTTTA TTCCGCAGTG CTGCCAGTAT TTTAACCGCC
TTGGCTGCCT TACCACCCCA ACCCATCGCT GCCGAAAATA TGGCTGAGTA TGTGCGAAAA
GGTAAACCAG GAGCCGTAAT AGTCGGTTCT CATGTCAAAA AGACAACGCA ACAGCTAGAA
GCGCTGTTAC AAGTTGCGGG AACAGTGGGA ATTGAAGTCA ATGTGTCACG ATTACTTGAT
GATCAGGTAG ATGCAGCTGA TATACTGTTA TCTCAAATTA AAACAAGTGT CGAGGAAGTA
CACGAATCTG GTAAAACACC GGTAGTTTAT ACGAGTCGTC AAGAACTCAC ATTTAAGGAT
GTTAAAACCC GGTTGGATTT TGGCATCAAA GTATCAAGTT TATTAATGGA TATTGTCCGC
AATTTACCCC CTGACATTGG ATTTCTCATC AGTAAGGGGG GAATTACTTC TAATGATGTA
TTAAGTACTG GATTAGCTTT AACTTCTGCT CGCTTACTTG GTCAAATTTT ACCTGGTTGT
TCGATGGTGC TAACATCGTC TAACCATCCT CAATTTCCCG ATTTACCAGT AGTGTTATTT
CCAGGGAACG TTGGTGATAC TAACGCCTTG GGAAAAATTT ATCAAAGATT AACTAAAAAT
ACTTAA
 
Protein sequence
MSNKPKIIVL DDDPTGSQTV HSCLLLMRWD VETLRTGLRD DAPIFFILTN TRALPPESAA 
SVTREVCQNL KVALAAEAVN DFLIVSRSDS TLRGHYPIET DAIAQELGSF DAHFLVPAFF
EGGRITRDSI HYLTIDGVPT PVHETEFARD SVFAYHHSYL PKYVEEKTQG GINAESVERF
LLSDIRTGSL ERLLKLTDNQ CAVVDGETQA DLNRFAVDVL AAASQGKRFL FRSAASILTA
LAALPPQPIA AENMAEYVRK GKPGAVIVGS HVKKTTQQLE ALLQVAGTVG IEVNVSRLLD
DQVDAADILL SQIKTSVEEV HESGKTPVVY TSRQELTFKD VKTRLDFGIK VSSLLMDIVR
NLPPDIGFLI SKGGITSNDV LSTGLALTSA RLLGQILPGC SMVLTSSNHP QFPDLPVVLF
PGNVGDTNAL GKIYQRLTKN T