Gene Ava_4824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4824 
Symbol 
ID3679400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6060462 
End bp6062138 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content44% 
IMG OID637720181 
ProductBeta-Ig-H3/fasciclin 
Protein accessionYP_325316 
Protein GI75911020 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2335] Secreted and surface protein containing fasciclin-like repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.597464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.196528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAGTT CGGTTCGACG GTCACTAGCC TACACAACTT TGCTGGCTTT GGGGATGACA 
GCTATAACAA TAAATCCCTT AATAGTTTCT AAACCAGCTT CAGCACAAAC ACCTGTCCCT
ACAGAAACTC CGTCTACTAC AGGTTCCAAC TTTTCTGATG TCAGTTCAGA TTACTGGGCG
CAACCATTTA TTCAAGCTTT AGCGCAAAGA AATATCATTG CTGGTTTTCC CGATGGTACT
TTTAGACCAA ACCAGGCAGT AAGTCGCGCT GAGTTTGCCA CATTAATTCA GAAAGCTTTT
AATCAACAAC CGGTCCGACA ATTAAGTGCA TCTGGATTTA CAGATGTACC TGCAAATTTC
TGGGCATCGC AAGCAATTCG GGAAGCTTAC GAAACGGGAT TTCTCTCCGG CTATCCAGGG
AATGTGTTTC GCCCCAATCA ACAGATTCCT AGAGTACAGG CGATCGTTGC TTTAAGCAGT
GGTTTAAACT TAACTACAAC TGATACTGCG TCAAATATTC TCAGCAATAA CTATGCAGAT
GCTTCGGCAA TTCCTGACTA TGCTGTCAAC GGCGTAGCCG CAGCAACACA AAGCAACATA
GTTGTTAACT ACCCAAATGT AAGAGAACTG AATCCCTCAA CATCTCTTAC CCGTGGGGAA
GCTGCCGCAA TTTTGTATCA AGCTTTAGTT CGACAAGGAC AAGTACAACC TCTACCTAGC
AATGTTGCAG CTGCTAACTA CGTAGTGGGT GGGACTGGTA CAACAGGAGG TACACAAGGT
GCTAATAATA TTGTTGCTTT GGCAGCATCA AGTAACTCTT TTAGTACCTT GACTTCTTTA
TTGAGAACCG CAGGTTTAAC AGATATTCTA GAGCAACCAG GGCCTTACAC AGTCTTTGCT
CCCACCAATG AAGCATTTGC AGCGTTACCT GCGGGTACTT TAGAACAACT GCAACAACCA
CAGAACAGAG AGTTGTTGGT GAGAATTTTG CGCTATCATG TGGTTCCTGG TCAATTAACT
GCTAACCAAC TCTCTTCTGG ACAACTGACA ACTGCTAGCG ATGCACCAGT CAATGTGAGA
GTTGACACAG CCAATAATCA AATTGCCGTT AATGAGGCGA GAGTTGTTCA AGCAAATATT
CAAGCTAGCA ATGGTGTTAT CCATGCTATT AACGAAGTCC TGATTCCACC TAATTTAACT
GGTCAGCAGC CGCAAGAAGG AACCCCTCAA GCACAAAATC CGGGTGCTGT CACTCCAGGT
AGAGCTACCC GTGGCGGTTC TAGTTACATA GGGGTTGCTG GTAACATTGG TTTAGGTGGT
GATACAGCTC TCAGCGATAG CAACTTTGCA GTTATCAGTA AAGTTGGTTT GACACGCAAT
CTATCAGTCC GACCATCAGC TGTTTTTGGT AACGATACGG TATTTCTAGT GCCGTTGACC
TTGGATTTCA CACCCCGCGC AGTAGAGCCT GGTGTTGTGC AGCCATTCGC CGTATCACCT
TATGTTGGTG CTGGTGTAGC AATCGAAGCT AGTGGCGACA CTGATATTGG TTTACTGTTA
ACTGGTGGTG TTGATATTCC TTTAGGACAG AGATTTACCA TTAATGGTGC TGTTAATGCA
GCTTTTGTAG ATGAAACTGA TGTTGGTTTG CTATTAGGTA TTGGCTACAA TTTTTAG
 
Protein sequence
MFSSVRRSLA YTTLLALGMT AITINPLIVS KPASAQTPVP TETPSTTGSN FSDVSSDYWA 
QPFIQALAQR NIIAGFPDGT FRPNQAVSRA EFATLIQKAF NQQPVRQLSA SGFTDVPANF
WASQAIREAY ETGFLSGYPG NVFRPNQQIP RVQAIVALSS GLNLTTTDTA SNILSNNYAD
ASAIPDYAVN GVAAATQSNI VVNYPNVREL NPSTSLTRGE AAAILYQALV RQGQVQPLPS
NVAAANYVVG GTGTTGGTQG ANNIVALAAS SNSFSTLTSL LRTAGLTDIL EQPGPYTVFA
PTNEAFAALP AGTLEQLQQP QNRELLVRIL RYHVVPGQLT ANQLSSGQLT TASDAPVNVR
VDTANNQIAV NEARVVQANI QASNGVIHAI NEVLIPPNLT GQQPQEGTPQ AQNPGAVTPG
RATRGGSSYI GVAGNIGLGG DTALSDSNFA VISKVGLTRN LSVRPSAVFG NDTVFLVPLT
LDFTPRAVEP GVVQPFAVSP YVGAGVAIEA SGDTDIGLLL TGGVDIPLGQ RFTINGAVNA
AFVDETDVGL LLGIGYNF