Gene Acid345_4407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4407 
Symbol 
ID4073313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5234513 
End bp5235469 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content57% 
IMG OID637986440 
Producthypothetical protein 
Protein accessionYP_593481 
Protein GI94971433 
COG category[R] General function prediction only 
COG ID[COG1611] Predicted Rossmann fold nucleotide-binding protein 
TIGRFAM ID[TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGAGA GACTCTGGCG AACAACTTTC CGCGGCTATT CACGTTCTGT ATGCTTGGAA 
GGCTTTCTAA GTACAAAGAC CATGGCCGAC AAGACGCCCA AACCCTTTCG CTCGCCGCTG
GCTTATGAAA ACCCAAAATT TGTAAATGGA CCGGATGGCC GACCGCTTCG CATTATGTCG
GAATATGCGG AACCGCTGGC GCGGTTTCGG CGGGAACGCA TCCAGGACAC GGTGGTGTTC
TTCGGATCGG CGCGGTGGCA TTCGCGTTCG ATCGCGGAAG AACATCTGCA ACTGTTGGAG
AAGCCGGGTT CGGCGCAGCC GGCGCCTCCA GAAGAACAGC TGCGGTTGAA GATGGCGCGG
GCCGATGTGG AGATGGCGCG CTATTACGAA GACGCACGCA AGCTGGCATA CATGCTGGCG
GGCTGGAGCA AAGATCTTGG TGGCAGGCGG CATCGGTTTG TAGTCACCAC CGGCGGCGGA
CCAGGGATCA TGGAGGCAGC AAATCTTGGT GCACATGAAG CGGGCGCGAA GACGATTGGG
CTGAATATTC GTCTGCCGTT CGAGCAGATG CCGAACCCGT ACATCACGCC GGAGTTGAAC
TTCGAGTTCC ATTACTTCTT CATGCGCAAA CTGTGGTTTG CGTATCTCGC GAAGGCGCTG
GTGATTTTCC CCGGCGGCTT TGGGACCTTC GACGAACTGT TCGAGATCCT GACCCTGGCG
CAGACGGAGA AGATGGCGAA GAAAATCTTC GTGGTGATAT ATGGCACCGA GTACTGGAAA
AAGGTGATCA ACTTCCAGGC ATTCGTGGAT GCGGGTGCGA TCGCGCCGGA CGATCTCAAT
CTGTTCAAGT TTTGCGATGA TCCGCAGGAG GCATTCGAGT ATCTGCGGGA CGGGCTGACG
GAGTTCCACC TGGGAAACAC GCACAAGAGC CCGGAGATTG CGAAGACGAG ACTGTAG
 
Protein sequence
MPERLWRTTF RGYSRSVCLE GFLSTKTMAD KTPKPFRSPL AYENPKFVNG PDGRPLRIMS 
EYAEPLARFR RERIQDTVVF FGSARWHSRS IAEEHLQLLE KPGSAQPAPP EEQLRLKMAR
ADVEMARYYE DARKLAYMLA GWSKDLGGRR HRFVVTTGGG PGIMEAANLG AHEAGAKTIG
LNIRLPFEQM PNPYITPELN FEFHYFFMRK LWFAYLAKAL VIFPGGFGTF DELFEILTLA
QTEKMAKKIF VVIYGTEYWK KVINFQAFVD AGAIAPDDLN LFKFCDDPQE AFEYLRDGLT
EFHLGNTHKS PEIAKTRL