Gene Ava_4059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4059 
Symbol 
ID3681680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5044434 
End bp5046179 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content45% 
IMG OID637719410 
ProductS-layer region-like 
Protein accessionYP_324558 
Protein GI75910262 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value8.41951e-11 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.105376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAACA TCGTATGTAA GTCTTTGTTT ATAGCTCCCG TACTGCTGGG AGTGAATGTG 
TTAATGGCTC AAGTTGTATC AGCATCTCCG GTAGCAGGAG ACGTTGAAAC ACAACAAGAA
GCGGATGATA ACTCATCAAT CACTCAGGTG ACATCTGTTT CGCAACTATC TGATGTGCAA
CCGACAGACT GGGCTTTTCA AGCTTTACAA TCTTTAGTTG AGCGTTATGG TTGTATTGCT
GGATATCCCG ACAGCAGCTT TCGGGGTAAT CGCGCCATAA CTCGCTATGA GTTTGCTGCT
GGTTTGAATG CTTGCTTGAA TCGGATTAAC GAACTGATTG GGACAAGTAC GGCGGATTTA
GTCACAAAAG AAGATTTAGC CACACTACAA AGACTGCAAG AAGAGTTTGC CGAAGGACTA
GCAGAACTGC GGGGAAGGGT AGATGGACTA GAAGGGCGTA CAGCCATCTT AGAAAGTCAA
CAATTTTCTA CCACTACCAA GTTGAATGCA GAGGTAATTC TTACTGTTGG TAGTGTGTTT
GGTGGAGATA GAGCGTTAAA CTCTGACCAA TGGCGAACCA TAAATGCACA GCCTGCTGGT
ACTGCACGGG AGAATGCTAA AAATAGTGCC TATGGAGCCG CAGGAAGGAA TCTGCAAGAT
AATGCTATTT TAAGCGATCG CGTCCGCTTG AATTTAGTCT CTAGTTTCAC TGGCAAAGAC
CGCTTGTTTG CTCGGCTAGA AGCAAACAAC ACTACCGCAT TTAATGCTCC AGTGACAGGC
ACTAATATGA GTCGTCTGGG GTGGGATGCA ACTGGTAACT TAGATAACAG TGTTCAATTA
GGCAAGCTAT TTTACCGCTT TCCTGTAGGT GATAAACTCA ACATCATTAT TGATGCCATT
GGTGGAGAAT TTTACGATAA CTTCAATACG ATCAATCCCC TGTTAGCATC AGCTCCTACC
GGTGCAGTAT CCCGTTTTGG TCGCTTCTCG CCTATTTACC GAGCGAGTAA CACTGGTTCT
GCTAGCAATA CCGGTTCAGG AATTAGCGCC ATATTTCAAT TGAGTAATGC TGTGACTTTC
TCGGCGGGAT ATCTAGCCAG ACGAGGCAGT GAGCCGACAC CCGGTAGAGG TTTATTTGAT
GGTAGTTATG GAGCCTTGGC GCAGTTAGAA TTTCAACCAA ACACAAACTT AACCCTGGGT
TTGAGCTATG CTCACTCCTA CTTTAGTGGT GAAGCCGGGG ATGTTACCGT TTCCGGCGCT
TATGGTAGTG CTTTCGCTAA TACCCCGTTT GGTTCTGCGG TGTCTACTTC CGCCAATTAT
TATGGTTTAC AAACGAGTTA CCGTTTCAAT CCCCAGTTTA TAGTCTCTGG ATGGGTCGGA
TATACCCAAG CTATAGCCGA AGCCAGTTCC GGTACTACAG TCAACCGTGG TGATCAAGCA
GATATTTGGA ACTGGGCAGT AACTTTAGCT TTCCCCGATT TGGGGAAAAA GGGGAATTTG
GGTGGTTTAA TTTTTGGACA ACCGCCAAGA GTTACCAGTA ATGACTTTGG CCCCGCTAAT
TTGACTGCTA CCAGCGCTCG GCGTGAAGAT ACAGATACAT CTTTCCATGT GGAGGCTTTG
TATCGCTACC AACTGAACAA TAATATCTCG ATTACACCAG GGTTAATTGT CATATTTAAC
CCAGAAAATA ACAGCAACAA CGATACTATC TATACCGGTG TTGTTCGCAC TACGTTTAGA
TTTTAG
 
Protein sequence
MFNIVCKSLF IAPVLLGVNV LMAQVVSASP VAGDVETQQE ADDNSSITQV TSVSQLSDVQ 
PTDWAFQALQ SLVERYGCIA GYPDSSFRGN RAITRYEFAA GLNACLNRIN ELIGTSTADL
VTKEDLATLQ RLQEEFAEGL AELRGRVDGL EGRTAILESQ QFSTTTKLNA EVILTVGSVF
GGDRALNSDQ WRTINAQPAG TARENAKNSA YGAAGRNLQD NAILSDRVRL NLVSSFTGKD
RLFARLEANN TTAFNAPVTG TNMSRLGWDA TGNLDNSVQL GKLFYRFPVG DKLNIIIDAI
GGEFYDNFNT INPLLASAPT GAVSRFGRFS PIYRASNTGS ASNTGSGISA IFQLSNAVTF
SAGYLARRGS EPTPGRGLFD GSYGALAQLE FQPNTNLTLG LSYAHSYFSG EAGDVTVSGA
YGSAFANTPF GSAVSTSANY YGLQTSYRFN PQFIVSGWVG YTQAIAEASS GTTVNRGDQA
DIWNWAVTLA FPDLGKKGNL GGLIFGQPPR VTSNDFGPAN LTATSARRED TDTSFHVEAL
YRYQLNNNIS ITPGLIVIFN PENNSNNDTI YTGVVRTTFR F