Gene Ava_D0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_D0004 
Symbol 
ID8952391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_014000 
Strand
Start bp1679 
End bp3643 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content38% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003541127 
Protein GI292905256 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAA GGAAAAATAA TGGTGGTATT CCTAGATTTT TTGACCTACC ACAAAGTAGC 
AAAGTATCTT TTAAGGTTCC CGGTAAAAAA TCAAATCGTC GCAGATTACC CGATGGCAAT
TATCTTGATG ACGATGCGAT CGCACCACGA CTAGATTTTA TAGAAGATTT TGAACTTACT
GTTACTGGAG CGCCGAGAGT CTTTAGTCCA GAGCAAGAAT ATAATTATAA ATCTGCGTGG
AAGAGATACC AAAAAGGTAT GGAGATAGCC CAAGAAGATA AAGTATTGAA AGACGGTGAT
GATAAATATA AAATACCTTC CCCGAATGCA CCCAAGGGAT ACTGGGAAGT CAGCAAGAAA
CCTCAAAATA ATATTACTTG TGAATTGCCT TTGCTTTTAA ATCTTCCTAA ATGGTTTGTT
GGGGATATAA GATTTGATGG AGATTTTGAA ACTATAAACA CTCCAGCAAA TGTAGTAGCT
CAAAGTTATG ATAATCAGAC AGGATTGTAT ACAGCCACTA TTATTGAAAC TATTAGTATT
ACAGCCGATA GTGAAGGTAT AGTTAATAAC GATGCTTTGA TAATTATTAA TTCAACAGGT
GGCGCTTTTA ATGATGGTTC ACCTGTAAAA ATCACCTCAA AGGTTATTAC AACTCATCCC
GGTGGTGGCA TTAGTATTGG CAATTCTATA TACAAAGGAA CACTTGTACC ACCTTTTGGC
GGTAGAGTTC CAGACCCGGA AAGTGGATTT GAATTAAAAG AATGGGTGTT TGATACCTTT
AGCAATGAAG AACAAACAGA GTTAGAATTT ATAAATAATT CCGAATTTAC TACTGGAACA
AACAGAGAAG ATTACGTAGG CTTTACATTA GAAACTACTG TTGCTGGTGG TTTTGCCTCT
GAAACATTTC CAGAATGGGA AGTAACTTTT GAAGTAACAG TAGAAATTTA TCTATTTCCT
GGTGGATTAG AGTTTACATT TGAACCATAC TTTGATGATG AGCTACTTAA TAACTATCGA
CCAGGAATAG TTTTTAATGA AGAAACTGGA GAAATTGAAT ATTTTCCGAA CGGAGAAAAC
ACGGCAACAG GTGTAATAGA TGGTGCAAAT GTTAGCTCTA TTTATACTTG GCAATCTAAT
TTTGATAAAA GCAATATAGT AACTAGTAGT GTAAAAATAG AAAAAACTTG CAATCAAGCA
ACCAATGTAT CTGGTTTTAC TCTTAGGTCT AGTGGGGCTA TAGATTTTAA TTCTGAACTT
AATATTAATA AAATAAGTGG CGTTCTTTCT TGGACAATAT CACTAAGCTA TAGCGGTGCA
GAAATATATC CAAATGACCC TATTGATGCA CTCTTTACTC AACAATATGT AGAATTTTCT
GGTGAGAATT TTACAGAAGA TGACAAAGAA TCTTATAGCA ATGGAAGTCT TGTTTCATTG
GAACAAAACA ATTTAACTTA TCAACATGAA TTTAGTGTAA ATTTTAAATA CCTAGAAGGA
GGGGGGCAGT TTTCAGCATT CTTAACTCAG CAAGTAAGAA TATTTGAAAT TGAATTTGCT
GATGGTAGTA AATGGGAATG TGGTAATGCT GACGGTGACG GTGACGGTGA CGGTGACGGC
GACGGTGATA GTGACGGTGA TGGCGACGGT GATTGGCAAT GCAACTGCCC TGACGCAACC
AAGAAAAAAT CCCCCAACCC CAACAGCACA TCAGATGACG GAATAGAAGA AGACTGGAGT
AATACCGATG CTGGCGCACA AGATGGGCAG TGTAAGCACA TCTGGGCTGT GAGGATCGTG
CTGAATGATG TAGCGCCAGA GGAAATTCCT ACAGATATGC CCATTAGTGA AAATTTCCCA
AATGGTAACG GGAACAGTAC TAACCCCAAA GTCATTGGGG GAAATGGTGG AATAGGATTT
GATGGATGGA ATCCAGGCAA AAGAAGGAGG GGGAAGAATG CCTAA
 
Protein sequence
MGKRKNNGGI PRFFDLPQSS KVSFKVPGKK SNRRRLPDGN YLDDDAIAPR LDFIEDFELT 
VTGAPRVFSP EQEYNYKSAW KRYQKGMEIA QEDKVLKDGD DKYKIPSPNA PKGYWEVSKK
PQNNITCELP LLLNLPKWFV GDIRFDGDFE TINTPANVVA QSYDNQTGLY TATIIETISI
TADSEGIVNN DALIIINSTG GAFNDGSPVK ITSKVITTHP GGGISIGNSI YKGTLVPPFG
GRVPDPESGF ELKEWVFDTF SNEEQTELEF INNSEFTTGT NREDYVGFTL ETTVAGGFAS
ETFPEWEVTF EVTVEIYLFP GGLEFTFEPY FDDELLNNYR PGIVFNEETG EIEYFPNGEN
TATGVIDGAN VSSIYTWQSN FDKSNIVTSS VKIEKTCNQA TNVSGFTLRS SGAIDFNSEL
NINKISGVLS WTISLSYSGA EIYPNDPIDA LFTQQYVEFS GENFTEDDKE SYSNGSLVSL
EQNNLTYQHE FSVNFKYLEG GGQFSAFLTQ QVRIFEIEFA DGSKWECGNA DGDGDGDGDG
DGDSDGDGDG DWQCNCPDAT KKKSPNPNST SDDGIEEDWS NTDAGAQDGQ CKHIWAVRIV
LNDVAPEEIP TDMPISENFP NGNGNSTNPK VIGGNGGIGF DGWNPGKRRR GKNA