Gene Acid345_4542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4542 
Symbol 
ID4070221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5386154 
End bp5387419 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content60% 
IMG OID637986582 
Producthypothetical protein 
Protein accessionYP_593616 
Protein GI94971568 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00650799 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.54265 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCGG GCCTTTTTGC GCGGGAATTG GGCCCAGCGC AGAAGATCCT CGCGACGGCG 
GCTCTCGCCT TCGCGGCGTT CGGCTGCGCT CTGCACCCGT ACGTTTATCA CGACTTCCTC
ATCGTGCCGT ACTTCGCGGT GGGGCTTGCC TGCATTCTTA TTCTTCAACT GAGAGTCATG
CCCTCGGTAC GCGATGCTAT CGCGGTCGTC GTGCTTGGAT TGGCGCTGCT GCAGGTGGAC
CTGCGGCTGC TTGGGTACGC GACATCTGCG ATGGCGGTGT TGTCGTTGTT CGGGTTGGCG
AGTTTGCTGG TGCTGGGATG GCGTGCGATT TGGGGCAAAG CGAAAGCAGA CGCGTTGCAC
CGGGCTTTTG TGGCCGCAAT AGGGTTGGGC GTTTGCGTGG CATTTACGGG CCTCTATATC
GAGCGCAGTG CCTTCTGGCA GACGAAGATG TACGACCTGT TTTTGTATTC CTTTGACGCG
AGCCTGGGAG GACAGTGGAC GTTCCGGCTG GCGCAGTTTA CGGCGCACCA TCCAGGGGCT
CATTTTGTGT CGGCGATGGT CTACAACGTG GTGCTCGTGC CACCGGCACT GGTGTATGCG
GCGCTATTGA ATGACGAGCG TCGTGCGCGG ACTGCGCTCT GGGCGTTTCT GATTGTGGGC
CCGCTGGCGT GCGTGTGTTT CCTGCTCTTT CCAGCGACGG GGCCGGTGTA TGCGTTCAAG
ACCTTTCCGA TGTTGGCCGT TCCTGCTGGC GAGATCGCAC GACTGGTTCC AGGGCCGGTC
GGGATCAGCG GGCCGAGAAA TGCGATTCCA TCGTTGCACT TTGCGTGGGT ACTGCTGGCG
TATTGGAACT CGCGAGACAC GAAGGCAGCG ATTCGTGTTT TTTGTGCAGT GATGCTCGCG
CTGACGATCT ACGCGACGTT GGAGACGGGT GAGCACTACG GCGTGGATCT TCTGGTGGCG
GTGCCGTTCG CGCTGGGGAT CCAGGCGTTG GCGATGTGGC TGGGTGGGAT TCGAAGCCGG
TGCGTCACGC AGGCGATCTT TGTGCCGCTA GGGATCACCG TTGCATGGTT CGTGTTGCTG
AGGTTCTGCA ACCGCGTTTG TTGGGTTTCT GCGGTTGTGC CATGGGCAGC GGTGCTGCTA
ACGCTTGGAG CATGCCTGTA TCTGTATCGG CGGCTGGTGG CCGTGCAGAA GGAATCCGGC
TCTATCGAAA AGCAGAGCGT GTCGCGAGAG ACGACGGATT TGGTGCACGC AGGATCTGCG
GGCTAA
 
Protein sequence
MSAGLFAREL GPAQKILATA ALAFAAFGCA LHPYVYHDFL IVPYFAVGLA CILILQLRVM 
PSVRDAIAVV VLGLALLQVD LRLLGYATSA MAVLSLFGLA SLLVLGWRAI WGKAKADALH
RAFVAAIGLG VCVAFTGLYI ERSAFWQTKM YDLFLYSFDA SLGGQWTFRL AQFTAHHPGA
HFVSAMVYNV VLVPPALVYA ALLNDERRAR TALWAFLIVG PLACVCFLLF PATGPVYAFK
TFPMLAVPAG EIARLVPGPV GISGPRNAIP SLHFAWVLLA YWNSRDTKAA IRVFCAVMLA
LTIYATLETG EHYGVDLLVA VPFALGIQAL AMWLGGIRSR CVTQAIFVPL GITVAWFVLL
RFCNRVCWVS AVVPWAAVLL TLGACLYLYR RLVAVQKESG SIEKQSVSRE TTDLVHAGSA
G