Gene Acid345_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3843 
Symbol 
ID4070995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4551902 
End bp4553320 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content60% 
IMG OID637985867 
Productdiguanylate cyclase 
Protein accessionYP_592917 
Protein GI94970869 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCAG AGCGCGCACG TCAAATTCTG TACAGTGCGG CGGTCGCGAT TTTCGTGTTC 
GCGAGCGTAT GGGTAGGACT CACGCTGACC CGGGGAGAAG ACAGAGTTGC GGCGATTTGG
CTGTCGAATG GCGTGCTGGT GGTGCTGTTG CTCAGCAAAG GCAAGACGCC GACGCTGCCC
CTGGTAATGA TTGGTTTTCT CGCCAATCTG CTGGCGAACC GGGTGGCAGG GGACCAGTGG
TTGCAGGCAT CGTCGCTTGC CGCCATCAAC GCGCTCGAGG TAGTGATTGC CTATGCGTTG
TTGACGCGGC GTGGACGCGA AGTAGACCTG CGAGACCCGA GTGGATTGGG CCGCTTCGTT
CTGTGGGGCG CGATCGTCGC GCCACTCATC TGTTCACTGC TCGGCACTAC CGTGGTGGTG
GCGTCGAGAG GCGGCGACTT CCAGGCCACT TTGCGGGTTT GGTTCCTAGC CGATGCGCTT
GGGATGGCGG TCATCGCGCC AGTGCTCTTC GTTTTTCTCA AAGACCATGT GTTGACTGAA
TTGTTTGGCC GGGAGAAACT CGGCGGTACC GCGTTAATGC TGTTGCTGTT GGGACTCAGC
CTGGCAGTGG TGTTTTCGCA GTCGAGATTG CCCCTCCTGT TCCTGGTTTT TCCCGTCCTG
GTGCTGGCCG TGTTCAAGCG TGGGTTTGCC GGCGCAGCCC TGGCCGTAGT GACGATCGCG
GCGATTGGAG TCATCGGCGC GGTGAAGGGC TTCGGGCCGA TCACGATCGG GTCGGGGAAT
GCATTCACGG ACCGGGTACT GTTGTTGCAG GCATATCTTG CATGCGTGGT GGCGACATCC
TTTCCGCTGG CGGCGGTGTT GGGAGAGCGA GACCGGTTAC ACGAGCGGCT AACAGGATTG
GTATATGTGG ATTGGCTGAC TGACTTGCCA AACCGCCGTC ACTTCGACAT GCGATTCAAC
AGCGAGTGGC GGCGCGCAAT GCGGTCGCGA ATGCCAATCT CGCTGATGAT GATCGACGTT
GACCAGTTCA AAGAATACAA CGACGCTTAT GGGCACATGG CGGGAGACCG ATGCCTGGCC
AAACTCGGGT CGGTGATGGC GGCCTCGGTG AAGCGATCGG CGGATTTCGT GGCGCGTTAC
GGCGGCGAAG AGTTCGTGGT AATTCTGCCG GAGACGACGG CAACGGGGGC GGGAATCGTA
GCTGCCAATG TGATGGAGGC AGTGGCAGCC CTGGAGCTGC CGCACGCAGG GAGTCCACAC
AAGCAAGTGA GTGTGAGCGT GGGAATCGCC TATGCGCATC CGGACATAGG GGCGGAGCCG
TCAGAACTGG TGCGCTCGGC GGATCGTGCG CTGTACACGG CGAAGCGTGA TGGACGCAAC
TGCGTCCGAG TGGCACTCGA ACAAACGACA AGTGCGTGA
 
Protein sequence
MASERARQIL YSAAVAIFVF ASVWVGLTLT RGEDRVAAIW LSNGVLVVLL LSKGKTPTLP 
LVMIGFLANL LANRVAGDQW LQASSLAAIN ALEVVIAYAL LTRRGREVDL RDPSGLGRFV
LWGAIVAPLI CSLLGTTVVV ASRGGDFQAT LRVWFLADAL GMAVIAPVLF VFLKDHVLTE
LFGREKLGGT ALMLLLLGLS LAVVFSQSRL PLLFLVFPVL VLAVFKRGFA GAALAVVTIA
AIGVIGAVKG FGPITIGSGN AFTDRVLLLQ AYLACVVATS FPLAAVLGER DRLHERLTGL
VYVDWLTDLP NRRHFDMRFN SEWRRAMRSR MPISLMMIDV DQFKEYNDAY GHMAGDRCLA
KLGSVMAASV KRSADFVARY GGEEFVVILP ETTATGAGIV AANVMEAVAA LELPHAGSPH
KQVSVSVGIA YAHPDIGAEP SELVRSADRA LYTAKRDGRN CVRVALEQTT SA