Gene Acid345_1273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1273 
Symbol 
ID4073243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1546887 
End bp1548116 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content60% 
IMG OID637983282 
Producthypothetical protein 
Protein accessionYP_590349 
Protein GI94968301 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACA GCACTTACAA CGTCTTTGCC CCGGCGAGCG GGGTTCCGAT CAAGGCCTGG 
ACCAAGGGTG TGCCCGTGGA AGATCAGGCA AAGCAGCAAC TGATGAACGT TGCGCAGATG
CCCTTCGTCT TCAAGTGGGT CGCCGCCATG CCTGACGTGC ACTGGGGAAT CGGAGCGACC
ATCGGCAGCG TGATCCCTAC CAAGGGCGCG ATCATCCCCG CTGCGGTTGG CGTGGATATC
GGCTGCGGCA TGATGGCGGT TCAGACCAGC TTGCACGCGA ACCAGTTGCC GGAAAACCTG
CACGCGATTC GCGAAGCGAT AGAGTTGGCT GTCCCGCACG GACGCACCAA CAACGGCCGC
GCTGGCGACC GCGGAGCGTG GAGTGATATT CCCGAGCGCA ACCTCAACAC TTGGGGCGAG
TTGAAGCCGC GTTATGACGC AATCCTCGCC AAGCATCCGA AGTTGGACCG CGGCAACCAC
GCGAACCATC TCGGTACGCT CGGCACGGGA AACCACTTCA TCGAGGTGTG CCTCGATGAA
TCGGACAACG TCTGGTTCAT GCTGCACAGC GGATCGCGCG GCGTCGGCAA CCGCATGGGA
AGTTACTTCA TTGAACTTGC CCGCAAGGAC ATGGAGCGCT TCTTCATCAC TTTGCCGGAC
CGCGACCTCG CGTACTTTCC GGAGCACACC GAGCACTTCC ACGAATACGT GGAAGCGGTC
GAGTGGGCAC AGGATTTCGC GCGCTGGAAC CGTCACTTGA TGATGCAGCA GATTGTGGAT
GCAGTCCGGC GTTCGGGGCA GGTGCCGGAG TTTACCGCTG AAGTACAGGC GATCAACTGC
CACCACAACT ACGTGGCCCG CGAGAACCAC TTCGGCGAAA ACATTCTCGT GACCCGAAAG
GGCGCGGTAC GTGCGCGCCT CGGCGATATG GGAATCATTC CCGGCAGTAT GGGAGCACGC
TCGTTCATTG TTCGCGGTAA AGGCAACGTC GAGAGCTTCC ATAGCTGCAG CCACGGTGCG
GGACGCGCGA TGTCACGCAA CGAGGCGAAG AAGCGCTTCA CCGTGGAAGA CCACGTACGT
ATGACCGCTG GTGTCGAGTG TCGCAAGGAT GCCGATGTGA TCGACGAGAC TCCGGCTGCG
TATAAGCCCA TTGACGACGT GATGGCAGCG CAGTCGGACT TGGTCGAGAT CGTCTACACC
TTGCACCAGG TGGTCTGCGT AAAGGGATGA
 
Protein sequence
MTDSTYNVFA PASGVPIKAW TKGVPVEDQA KQQLMNVAQM PFVFKWVAAM PDVHWGIGAT 
IGSVIPTKGA IIPAAVGVDI GCGMMAVQTS LHANQLPENL HAIREAIELA VPHGRTNNGR
AGDRGAWSDI PERNLNTWGE LKPRYDAILA KHPKLDRGNH ANHLGTLGTG NHFIEVCLDE
SDNVWFMLHS GSRGVGNRMG SYFIELARKD MERFFITLPD RDLAYFPEHT EHFHEYVEAV
EWAQDFARWN RHLMMQQIVD AVRRSGQVPE FTAEVQAINC HHNYVARENH FGENILVTRK
GAVRARLGDM GIIPGSMGAR SFIVRGKGNV ESFHSCSHGA GRAMSRNEAK KRFTVEDHVR
MTAGVECRKD ADVIDETPAA YKPIDDVMAA QSDLVEIVYT LHQVVCVKG