Gene Acid345_3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3884 
Symbol 
ID4072219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4591970 
End bp4593973 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content61% 
IMG OID637985908 
Productadenylate/guanylate cyclase 
Protein accessionYP_592958 
Protein GI94970910 
COG category[T] Signal transduction mechanisms 
COG ID[COG2114] Adenylate cyclase, family 3 (some proteins contain HAMP domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCTG CCCCCCGTAA ACAATCGCGC GCCCAACGAC TCTTCCAGCG CGGAGCCGCG 
CTCGCTGCCG TCTTCACGGC AGTGCTACTC ATTGGTGTCG AAAATACCAA GCCCATGCGG
TGGCTCGAAA CCGGCACCTA CGACGCGCGT ATGACGTGGA GCCTCGATCC CTCACGTGCC
GATAAAAGCA TCGTCATTCT CGACATCGAC AACCCCAGCT TCGAGATCCT GAAGGAAAGC
TTCGGACGCT GGCCCTGGAC GCGTATGGCA TGGGCCGGCG CCATTGATTA CATGGCCGAC
GGCCACCCGA AAGTCATCGC CTTTGACTTC AAGTTCGGTG GCAGCGAAGA CCCCAAAGTC
GACCAGGCCT TCGCCGAATC CATTCGCAGC GGCCGCAATG TTCTCCTCGG CTTCTCCTTC
GATCCCGCGC AGATCGAGGA CGTCTCCGAC GTGAAGAACC GCAAGCTCGC GCTCCTCGCC
CGCGAGTCTC TCAGCGACAG TCACATCCTC GGCGAACATT TTCCTCCCGC GGACCAGAGC
CTCAACGTTC CGCTCGACAT TCTCGCCAAA GCTTCCGCCG GTATGGGCTG CCTCAACGCC
GTTTATGACG ACGATGGCGC CGTGCGCCGC ATGCCGCTCG GCTGCAACTA TGGCGACCTC
GCCTTCCGTA CCCTCGACAC TCGTGCTGTG GACTACGTTC GTGGCCACGA TGCCAGCCGC
TTCATCCGCG ACGGCCGCTA TGGCGACAGC ACCGGCGAGC GTATTCCCGT CGATGCCAAC
GGTCATCTCC TGGTCTGGTT TCATGGTCGT CCGCACGGCA CTTACGAACG GGTTCCTTTC
TGGAAGCTGG TATGTTCCGC CGCGCCCGAC AGCTGTCCGA ACCTGAAGGA GCCGATTCCG
CCTTCTTACT TCAAGGACAA GATCGTCCTC GTGGGCGCCA GCGCATCGGG GAGCTTTGAG
CTTCATTCCA CGCCCGTAGG CGATGCGCCG GGTGTGTTCG ATCGCGCTGC TGCCATCGAC
AACCTGCTGC ATGGCGAAGG CATATTGATC GCGCCGCTCT GGCAACACTG GATCCTCATC
GCCATCATGG CGCTGGTCGG CTGGATGGTG CTCACGCGTC TCGGCGTTAG CGTCGTCGGC
GCGCCGGTAA TGCTGTCGAT CCTCGGCGTC TATGCGGTAT TGGCAGCTGC GGTCTTCCGG
GTCGAACATC TCTGGCTGCC GATGGTCTCG CCTTTGAGCG CGGGAGCGCT TGCGTATATC
AGCGCCGGTG GTGTGCGCTA TGTCACGACG GGCCGCGAAC TCCGCGCCAC GCGCCACGCA
CTGGAGCGCC ACATGTCGCC GCAACTCGCG CAATACGCGC TGGAACACGG CGATAACCTC
GCCGGCGAGC GCCGCGAGCT CACCATCTTC TTCTCCGACA TTCGCAGCTT TACCACGCTC
ACCGAGAGCC TCCGCGACCA TCCCGACAAG TTGCTCGCGC TCCTCAATGA ATACCTCACG
GCGATGTGCG AGGTCATCTT CAAGTACGAG GGTGTAGTCG ACAAGTTCAT CGGCGACGGC
ATTCTCGCGC ACTGGGGAGC GTTCACGGAG GGCAAGAACC ACGCGCTGCT CGCGGCGCAA
GCCTCACTCG AGATGCTCGA CCGCCTCAAG CAGATGAACG CCGATTGGGC GGCGCAAGGC
CGCGACCAAC TGGCCATCGG CATTGGTCTC AACACCGGCG AAGTGACCTT CGGTAACGTG
GGCGCAGGCC AAAAGACTGA GTTCACCGTG ATCGGTGATC CGGTAAACTT GGCGTCGCGT
TTAGAAGGCC TCAATAAAGA ACACCATACT TCCATCATCA TCAGCGAATT CACTCTCGAA
CACTTGCGCG GTCTTGTGCA GACGCGCGAA CTCGGCGGCG TAAAAGTTAA GGGCAAGACG
ATCGAAACGC AGATTTATGA ATTGCAGGGT TTAGCAACAT CGCAAGAACC GGTCCCGGCG
CAAACCGTCG GGTCACGAGC TTAA
 
Protein sequence
MAAAPRKQSR AQRLFQRGAA LAAVFTAVLL IGVENTKPMR WLETGTYDAR MTWSLDPSRA 
DKSIVILDID NPSFEILKES FGRWPWTRMA WAGAIDYMAD GHPKVIAFDF KFGGSEDPKV
DQAFAESIRS GRNVLLGFSF DPAQIEDVSD VKNRKLALLA RESLSDSHIL GEHFPPADQS
LNVPLDILAK ASAGMGCLNA VYDDDGAVRR MPLGCNYGDL AFRTLDTRAV DYVRGHDASR
FIRDGRYGDS TGERIPVDAN GHLLVWFHGR PHGTYERVPF WKLVCSAAPD SCPNLKEPIP
PSYFKDKIVL VGASASGSFE LHSTPVGDAP GVFDRAAAID NLLHGEGILI APLWQHWILI
AIMALVGWMV LTRLGVSVVG APVMLSILGV YAVLAAAVFR VEHLWLPMVS PLSAGALAYI
SAGGVRYVTT GRELRATRHA LERHMSPQLA QYALEHGDNL AGERRELTIF FSDIRSFTTL
TESLRDHPDK LLALLNEYLT AMCEVIFKYE GVVDKFIGDG ILAHWGAFTE GKNHALLAAQ
ASLEMLDRLK QMNADWAAQG RDQLAIGIGL NTGEVTFGNV GAGQKTEFTV IGDPVNLASR
LEGLNKEHHT SIIISEFTLE HLRGLVQTRE LGGVKVKGKT IETQIYELQG LATSQEPVPA
QTVGSRA