Gene Ava_4857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4857 
Symbol 
ID3679277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6120266 
End bp6121852 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content40% 
IMG OID637720214 
Productphospholipase D/transphosphatidylase 
Protein accessionYP_325349 
Protein GI75911053 
COG category[I] Lipid transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes
[COG1555] DNA uptake protein and related DNA-binding proteins 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.404738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000312959 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGATAGTGG CGATCGCCGC CTGTCAAAAA GTCCAATCTC ACAATAATCG TCCTGCACCT 
CTACCGCAAG ACTCATTTGT GAAAGTTTAC TTTAATCAAT CCGAATCCTC AGAATATCGA
GAACCTTACC GTCAACAAAC TCGACTGGGA GATAACTTAG AACAGCAGAT TATTGACGCT
ATTTCTCAAG CTAAATCTAC TATCGATGTA GCAGTACAAG AATTGCGTTT ACCGAGAATC
GCCCAAGCCC TCAAAGACAA ACAAAAAGCG GGAATCAAAG TCAGAGTAAT TTTAGAAAAT
ACCTATACTC GTTCTTTGAG TAACTTGACA CCAGATGAAG TCAAGAAATT ACCTGAACGG
GAACAAGCAC GCTATCAAGA ATACTTTAAA TTTGTAGACC TAAACCAAGA TAATCAACTC
AGTCCTGAGG AAGTTAATCA GAGGGATGCA CTGATAATTT TACAAAATGC CAAAATTCCT
TGGATAGATG ATCAAGCTGA TGGTTCAGCA GGTAGTAAGT TGATGCACCA TAAGTTTGTG
GTTGTAGATA ATCGCATAGT AATTGTGACT TCGGCAAACT TCACCTTAAG CGACGTTTTC
GGGGATTTCT CTAATTCTTC AAGTTTGGGA AATGCCAACA ACCTATTACA CATTGATAGC
CCAGAATTAG CAGCTTTGGT CACAGAAGAA TTCAACCTCA TGTGGGGTGA TGGTGTTGGA
GGTAAACCAG ACAGTAAATT CGGTTTAAAT AAACCTGTAC GTCCTCCCCA AAAAATTACC
TTGGGTGACA ACACAATTAC TGTGCATTTT TCCCCAACTT CACCCACCTT ACCTTGGACT
CAAAGCAGCA ATGGCTTAAT TAATGAAAGC TTAAATTTAG CGAATAAATC TATTGATATG
GCGTTGTTTG TTTTTTCCGA ACAGCGTCTT GCTAATACAT TAGAAAAACG TCATCAACAA
CAAGTCTCAA TTCGAGCATT AATTGATAAA CAATTCGCCT ATCGTTATTA CAGCGAAGCT
TTAGATATGA TGGGAATTGC CCTGGGTAAT AAATGCCGAT ATGAAATTGA TAATCGACCT
TGGTCTAATC CCGTTACTAC GGTGGGCGTA CCCACTTTAC GAGAAGGAGA CCTGCTACAC
CATAAATTTT CTGTTATCGA CAACCAAACG GTAATTACAG GTTCTCACAA CTGGTCTGAT
GCAGCAAATC ATGGCAATGA TGAGACTTTG ATAGTAATTA ATAATCCCAC AATTGCTGCT
CATTATGAGC GTGAATTTGC TCGTCTTTAC GCTAAAGCTC AAGTCGGTGT CCCAGCCAAA
GTCCAAGCAC AAATTCAACA AGAACAAAAG CAATGTGGTC AAATTAAAAC TCCTACTTCC
AGTGAACTTA CTCCTACTCA AGTGGTGAAT ATCAATACAG CAAATTTGGC AGAATTGGAG
ACCTTACCCG GTGTAGGTAA AAAGCTAGCC CAAAAAATTA TCACCGCCCG TCAGCAGAGA
AAATTTGTCT CATCACAAGA CTTGGATAAA GTACCTGGAA TCAGTCCAAA GATGATAGAA
AATTGGCAAG GGCGTATTCA ATTTTAG
 
Protein sequence
MIVAIAACQK VQSHNNRPAP LPQDSFVKVY FNQSESSEYR EPYRQQTRLG DNLEQQIIDA 
ISQAKSTIDV AVQELRLPRI AQALKDKQKA GIKVRVILEN TYTRSLSNLT PDEVKKLPER
EQARYQEYFK FVDLNQDNQL SPEEVNQRDA LIILQNAKIP WIDDQADGSA GSKLMHHKFV
VVDNRIVIVT SANFTLSDVF GDFSNSSSLG NANNLLHIDS PELAALVTEE FNLMWGDGVG
GKPDSKFGLN KPVRPPQKIT LGDNTITVHF SPTSPTLPWT QSSNGLINES LNLANKSIDM
ALFVFSEQRL ANTLEKRHQQ QVSIRALIDK QFAYRYYSEA LDMMGIALGN KCRYEIDNRP
WSNPVTTVGV PTLREGDLLH HKFSVIDNQT VITGSHNWSD AANHGNDETL IVINNPTIAA
HYEREFARLY AKAQVGVPAK VQAQIQQEQK QCGQIKTPTS SELTPTQVVN INTANLAELE
TLPGVGKKLA QKIITARQQR KFVSSQDLDK VPGISPKMIE NWQGRIQF