Gene CA2559_05940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_05940 
Symbol 
ID9296684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp1334393 
End bp1336837 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content38% 
IMG OID 
ProductInorganic H+ pyrophosphatase 
Protein accessionYP_003715950 
Protein GI298207771 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.217127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTCAT TTATGATTTA CATGCCAATT GCAATGGCAG TTTTAGGTTT AATCTACATG 
TGGGTTAAAC AATCTTGGGT AATGAAACAA AACGCTGGAG ATGGTAAGAT GAAAGAAATT
TCAGACCATA TCTACGAAGG TGCACTAGCA TTTCTTAGCG CAGAGTACAA ATTACTTACA
ATTTTTGTTG TTGTAGTAAG TGTTCTGCTC GCTATAGTTT CAATTGTTGT ACCTACTACC
CATTGGTTAA TTGTAGTTGC ATTTATATTT GGAGCAGTCT TTTCTGCTTT CGCAGGAAAT
ATAGGTATGA AAATCGCTAC TAAAACCAAT GTAAGAACTA CGCAGGCAGC AAAAACAAGT
TTGCCGCAAG CCCTTAAGAT ATCTTTTGGT GGCGGTACTG TAATGGGATT AGGTGTAGCC
GGTTTAGCAG TGTTAGGGTT AACAGCATTT TTTATCATTT TCTTCCATAC GTTTATGGGC
GCTTCTTGGA CCAATACTAT GGATATGACA ATAGTATTAG AAACCTTAGC AGGATTTTCA
TTAGGTGCAG AGTCTATTGC ACTTTTTGCT CGTGTTGGTG GTGGTATCTA TACTAAAGCA
GCAGATGTTG GCGCAGATTT AGTTGGTAAG GTAGAAGCTG GTATTCCAGA AGATGATCCT
CGTAATCCTG CTACAATAGC AGATAACGTT GGAGATAATG TTGGAGATGT TGCAGGAATG
GGAGCAGATT TATTCGGGTC TTACGTAGCA ACAGTTCTAG CAGCAATGGT TCTTGGTAAC
TATGTGATAA AAGATATGGG TGGCGCTATT ACAGATGCTT TTGGCGGTAT TGGCCCAATT
CTTTTACCAA TGGCAATTGC AGGTGTTGGT ATTATCATTT CTATCATTGG TACTATGCTG
GTTAAAATTA ACAGCAATGA GGCTAAAGAA GATAAAGTTA TGGGTGCGTT AAACTTAGGA
AATTGGGTGT CAATTATATT GGTGGCTATT TCTTGTTTTG CATTAGTGAC TTGGATGTTG
CCAGAAACTA TGAAAATGGA ATTCTTTGGT GAAGGCCTTC AAGATATTTC GTCTATGCGT
GTGTTTTATG CAACCTTAGT AGGTCTTTTT GTAGGAGCAG TTATCTCTTC TGTAACCGAA
TTTTATACAG GTCTTGGTAA AAAACCAATA TTAAAAATAG TACAACAGTC TAGCACAGGT
GCAGGAACAA ATATTATTGC AGGATTAGCA ACGGGTATGA TTTCTACATT CCCATCTGTA
TTGTTATTTG CTGGTGCCAT TTGGGCTTCA TATGCATTTG CAGGATTTTA TGGTGTGGCA
TTAGCCGCTT CAGCAATGAT GGCAACAACG GCTATGCAAT TAGCAATTGA TGCATTTGGA
CCAATATCAG ACAATGCTGG TGGTATTGCA GAGATGAGTG AGCAAGAACC AATAGTACGT
GAGCGTACAG ACATTTTAGA CTCTGTTGGT AATACAACGG CAGCAACAGG AAAAGGTTTT
GCTATTGCTT CTGCAGCATT AACTTCATTA GCACTTTTTG CGGCCTATGT TACTTTTACA
GGAATAGATG GTATAAACAT TTTTAAAGCT CCGGTTTTAG CAATGTTGTT TGTAGGAGGT
ATGGTTCCAG TTGTATTTTC TGCCTTGGCT ATGAATGCCG TAGGAAAAGC AGCTATGCAA
ATGGTACAAG AAGTACGCAG ACAGTTTAGA GATATTCCAG GAATTATGGA AGGTACAGGA
AAACCACAAT ATGATAAATG TGTCGAAATT TCTACTCAGG CTTCTTTAAA AGAAATGATG
TTGCCAGGTT TATTAACTAT AGGCTTTCCA CTTATTATTG CTTTTGTGCC AATGCTTTTT
GGAATGAGCA CTTTAGCAAT TGCAGAAATG CTTGGAGGTT ATATGGCAGG TGTTACAGTT
AGTGGCGTGC TTTGGGCAAT TTTCCAAAAC AATGCTGGTG GTGCTTGGGA TAATGCTAAG
AAATCTTTTG AAGCTGGTGT GATGATTAAT GGAAAAATGA CCTATAAAGG TAGTGATGCT
CATAAAGCAG CAGTTACTGG AGATACTGTT GGAGATCCTT TCAAAGATAC TTCTGGTCCT
TCAATGAACA TATTAATAAA GCTAACTTGT CTTATAGGAT TAGTGATAGC ACCTATATTA
GGCGGTCATT CAGCTGAAAC TGATATAGAT GAAGTTGAAA CTATTGAGGT TAGTGAAAAC
GTTATTATTG AAGGCACTCA AGACCTTAAT AACAAAACTG AAACTGTAGT GGAGATGGTA
AAAAATGACG CTACCGGAGA AGTTGTTGCT AACGTTGAAA TTAGAAGAAC AGTAAATGGC
GAAACAACTA TCGAAGAGAA GTCTTTTAGA GGTACAGAAG AAGAAGTACG CTCTCAGTTA
AAAGATGTAG ATGGTATTAA AATTAAGGTG AAAGATAAAG AGTAA
 
Protein sequence
MESFMIYMPI AMAVLGLIYM WVKQSWVMKQ NAGDGKMKEI SDHIYEGALA FLSAEYKLLT 
IFVVVVSVLL AIVSIVVPTT HWLIVVAFIF GAVFSAFAGN IGMKIATKTN VRTTQAAKTS
LPQALKISFG GGTVMGLGVA GLAVLGLTAF FIIFFHTFMG ASWTNTMDMT IVLETLAGFS
LGAESIALFA RVGGGIYTKA ADVGADLVGK VEAGIPEDDP RNPATIADNV GDNVGDVAGM
GADLFGSYVA TVLAAMVLGN YVIKDMGGAI TDAFGGIGPI LLPMAIAGVG IIISIIGTML
VKINSNEAKE DKVMGALNLG NWVSIILVAI SCFALVTWML PETMKMEFFG EGLQDISSMR
VFYATLVGLF VGAVISSVTE FYTGLGKKPI LKIVQQSSTG AGTNIIAGLA TGMISTFPSV
LLFAGAIWAS YAFAGFYGVA LAASAMMATT AMQLAIDAFG PISDNAGGIA EMSEQEPIVR
ERTDILDSVG NTTAATGKGF AIASAALTSL ALFAAYVTFT GIDGINIFKA PVLAMLFVGG
MVPVVFSALA MNAVGKAAMQ MVQEVRRQFR DIPGIMEGTG KPQYDKCVEI STQASLKEMM
LPGLLTIGFP LIIAFVPMLF GMSTLAIAEM LGGYMAGVTV SGVLWAIFQN NAGGAWDNAK
KSFEAGVMIN GKMTYKGSDA HKAAVTGDTV GDPFKDTSGP SMNILIKLTC LIGLVIAPIL
GGHSAETDID EVETIEVSEN VIIEGTQDLN NKTETVVEMV KNDATGEVVA NVEIRRTVNG
ETTIEEKSFR GTEEEVRSQL KDVDGIKIKV KDKE