Gene Acid345_2095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2095 
Symbol 
ID4069694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2507498 
End bp2508802 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content60% 
IMG OID637984110 
Producthypothetical protein 
Protein accessionYP_591170 
Protein GI94969122 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.174556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGGCC CGATGCTCTG CAGCCGCCAC CGCGAACAGT CTAGTAGTCC TACCTCGATC 
TCGCGACTCG CGATACCATG CAGCGAGATG TCCGCACCCC GACTGCAGGC CTATCCAAAG
GCTTCGACTC CGAAACGCAC TTCGCGTTGG CTGATTGTTT TATTAGTTGT TGCCACGCTG
GTGGCGATTG CGTGGTATCC GGTCTACTTG CACCTGCGAG CGGTCGCGGT TCTGCTGCGG
ATCGAATCGT CGGAAGCGCA TGGATTCTTC GCGAACCGCG GCATTCACCC GATCAAAACT
GAAGAATCGA CGTTCGCGGC GGATAATCGC GGGCTTCGCA CGCGGCTGTA CGTGCCAACG
GATTTGAAGA GCGTGCCGGC GATGGTGATC GTGCATGGCG TCCATCATCT CGGATACAAC
GAACCGCGGT TGGTGCGCTT TGCGAAGGCG ATCTCGGGGG CAGGGATGGT CGTCTCGACG
CCAGAGTTAC CGGAGATAGC GGGATACGAG ATCAAGCCGG TCTCCATCGA GGAAATTGCC
GCAGCTGCGG ATGATCTTGC GGCACGCATG CATTCGCCGT GCGTGGGAGT GCTCGGGTTG
AGCTTCGCGG GAGGGCTCGC GCTTTCGGCG GCAAGCGATC CTGCGACCTC GCGGCATATC
TGTTATGTGG TGGCGATTGG CGCCCACGAC GACATGTCGC GGGTGATGAA GTTCTTCGCC
ACCGACCGCG CCGAATATCC CGATGGCCAC TCGCAGTCGA TGCCCTCGCA TGAGTACGGA
GCGCTGGTCG CGATCTACGC CCATCCTGAA GAATACTTTC CGGCCGCGGA CGTTCCGCTG
GCGCGTGAGG CGATCCGGCA GCAACTCTTC GAAGAGATCG TGAACGCGAA GGCTACGGCT
GCAAAGATGT CTCCGGAAGG GCAGGCGACG ATGCAGATGC TGCTGGCGCA GAACCATTCG
GTGGTGAACA AGATCCTGCT GGCGAACCTC GACAAGCACC GAGACGAAGC CGAACAGGTT
TCGCCGGGTC CGGAACTCTA CCGTCTGCAC GTTCCTGTTT TATTGCTGCA CGGCGCGGGA
GACAACGTGA TTCCGCCGTC GGAGACGCTA TGGCTTGCTA AAGATATTCC TGCCCAGAAC
CTGCGGGCGG TGCTGATCAG CCCGGCGATC AGTCACGTGG AGGTTGGGAA AGGCGCCACG
CTGATGGACA AATTCCGGCT TGTCCACTTC ATCGTGGTGC TGTTGGGAGA GGCCAAAGAC
GCGCCTTACA ACACGGTTGA GCTGCAACGC GTTGGCGGCG GGTAG
 
Protein sequence
MSGPMLCSRH REQSSSPTSI SRLAIPCSEM SAPRLQAYPK ASTPKRTSRW LIVLLVVATL 
VAIAWYPVYL HLRAVAVLLR IESSEAHGFF ANRGIHPIKT EESTFAADNR GLRTRLYVPT
DLKSVPAMVI VHGVHHLGYN EPRLVRFAKA ISGAGMVVST PELPEIAGYE IKPVSIEEIA
AAADDLAARM HSPCVGVLGL SFAGGLALSA ASDPATSRHI CYVVAIGAHD DMSRVMKFFA
TDRAEYPDGH SQSMPSHEYG ALVAIYAHPE EYFPAADVPL AREAIRQQLF EEIVNAKATA
AKMSPEGQAT MQMLLAQNHS VVNKILLANL DKHRDEAEQV SPGPELYRLH VPVLLLHGAG
DNVIPPSETL WLAKDIPAQN LRAVLISPAI SHVEVGKGAT LMDKFRLVHF IVVLLGEAKD
APYNTVELQR VGGG