Gene CA2559_08051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_08051 
Symbol 
ID9297094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp1764457 
End bp1766382 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content31% 
IMG OID 
Productinter-alpha-trypsin inhibitor heavy chain H2-like protein, precursor 
Protein accessionYP_003716357 
Protein GI298208178 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTTTA AATACCCAGA ACTACTTTAC GCGCTTTTTT TACTGGTAAT TCCTATTATC 
GTACATCTTT TTCAGCTTCG TAAATTTCAG AAAGAAGAGT TTACCAATGT TAAGTTCCTC
CAACGCGTAA TCTTACAAAC ACGAAAAAGC TCTCAACTTA AAAAATGGCT TACATTACTT
TCACGTCTCT TATTAATGGC GTGTTTAATA ATTGCTTTTG CTCAACCTTT TTTTACTGCA
AATGATAATG CTACTAAACC ACAAGAGACT GTAATCTACT TAGATAACAG CTTTAGCATG
CAGGCAAAGG GGCAAAAGGG TGAACTTTTA AAAAGAGCCA TTCAGGAGTT ATTAGAAACA
CTTCCTGAAG ATGAAGTTTT TACTCTTCTT ACAAATACAG ATAGATTAAA AAATACAACG
TTAAGAGAGT CTCGAAATGA GATACAAGAA ATTACTTATG ATGCTCAAAG CATATCTGTA
AACACTGCTT TAATACGTGC TAAAAATGAG TTTTCAAAAA CCAAAGGTGT TGTAAAGAAT
TTTGTTGCTA TTTCAGACTT TCAAATTAAT GACGATCCAT TTCAGCAAAC TCCTGAGGAG
ATCTCTGTAA ACTTTGTACA ACTTTCACCT GTAAATAAAG ATAACCTTTC TGTCGATAGT
GTTTACGTTA AAGATCGTGG CATTAATTCT ATCACATTAG CTACAAAAAT ATCAAGTACT
GGTTCACTAT CAGGAACACT TCCTGTTGCT CTTTATGACG GTAACACACT TTTAGCCAAA
ACAAGTGTGC AACTTGAGGA AAACAGCACT TCAGAAACAG TTTTCAACAT ACAAAACCCA
GAAAGTATTA ACGGCAATAT TCGTGTAGAA GATACTGGTT TACAATATGA CAACATACTA
TATTTTAGCA TTAACAAACC AGATCCTATT AAGGTAATTG CAATTAGTGA TGTAGATGAT
AGTTATCTAA AAAAACTATA TAAATCACCA GAATTTGAAC TTATCACTTC AACAACTGCT
CAATTAGATT ACAACCAATT AAATAATGCT AACTGTATAA TTTTAAATGA AGTCTTGCAA
TTACCAAATG GGCTTGCTAC TATTTTAAAT AAATTAACAG CAGCAAACGG TACTGTAATA
ATTGTACCAG CACAAGAAGC TAATATAAAC AGTTACAATT CTTTATTTAA CCAATTAGGA
TTATCACCTT TATCTGAGTT AAAAAATCAG GAAAAACAAA TTACCAATAT AGTCTTTGCG
CATCCTATTT ATGAAGGTGT TTTTGATAAG CGTATAGATA ACTTTCAATA CCCTAAGGTA
CAATCGTATT TTACAACAAC CGCCAATGCT AACAAGGTTT TAGGCTATCA AGATAACTCA
AGTTTTTTAG AACAAACTGG CAATGTATAC AGATTTACAG CTGCATTAAA TTCTCAAAAT
TCAAATTTCA AAAATGCGCC ATTAATTGTT CCTACATTTT ATAATATAGC CAAACAAAGC
TTAAAAGGCG GAACCTTATA TTATACTATA AACAACGCAA ATAGTCTTGA TGTAAATGTG
GCATTACCAC AAGATGACAT CCTTAAAATT GAAAGTGAAA ATGGGTCTTT TATACCATTG
CAACAAAACT TTAATTCTAA GGTTCAGATT ACAACTAATG AGCTACCAGA GTTGGCAAAT
AATTATGTGG TTAAAAACGA AAATCAAGAA TTGCTTAAAC TAAGTTATAA TTATAACCGT
GACGAAAGTG AGTTAACTTA CCAGAATATT TCTCAACTCG AAAATATTAC AATTAACAAT
CAAGTATCTT CATTTTTTAA GCAAATGCAA CAAGACAATA GTATAACCGA TTTATGGAAA
TGGTTTGTTA TTTTTGCACT ACTATTTCTT TGTATAGAAA TACTTCTCTT AAAATTCCTG
AAATGA
 
Protein sequence
MQFKYPELLY ALFLLVIPII VHLFQLRKFQ KEEFTNVKFL QRVILQTRKS SQLKKWLTLL 
SRLLLMACLI IAFAQPFFTA NDNATKPQET VIYLDNSFSM QAKGQKGELL KRAIQELLET
LPEDEVFTLL TNTDRLKNTT LRESRNEIQE ITYDAQSISV NTALIRAKNE FSKTKGVVKN
FVAISDFQIN DDPFQQTPEE ISVNFVQLSP VNKDNLSVDS VYVKDRGINS ITLATKISST
GSLSGTLPVA LYDGNTLLAK TSVQLEENST SETVFNIQNP ESINGNIRVE DTGLQYDNIL
YFSINKPDPI KVIAISDVDD SYLKKLYKSP EFELITSTTA QLDYNQLNNA NCIILNEVLQ
LPNGLATILN KLTAANGTVI IVPAQEANIN SYNSLFNQLG LSPLSELKNQ EKQITNIVFA
HPIYEGVFDK RIDNFQYPKV QSYFTTTANA NKVLGYQDNS SFLEQTGNVY RFTAALNSQN
SNFKNAPLIV PTFYNIAKQS LKGGTLYYTI NNANSLDVNV ALPQDDILKI ESENGSFIPL
QQNFNSKVQI TTNELPELAN NYVVKNENQE LLKLSYNYNR DESELTYQNI SQLENITINN
QVSSFFKQMQ QDNSITDLWK WFVIFALLFL CIEILLLKFL K