Gene Acid345_0519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0519 
Symbol 
ID4069939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp640368 
End bp641864 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content58% 
IMG OID637982524 
Producthypothetical protein 
Protein accessionYP_589598 
Protein GI94967550 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0828452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGAA TGGCGGTTAC CATCTCTGCA CTACTCTTTT GTGTCAGCAC GTCGCTGGCG 
CAACAAAACG CCACAACGGA CCCCGACCTC CAGGAGTTGA AACAACAGTT GCGAGATGTG
GTCTCCTCGC TTCAGGAGAC GCGTAGCGAG CTTAAGGAGT CGCAGCGGCA GATCCAGGCT
CTGCAAACAG AGGTAGCCTC TCTGCGCGCG ACGAACGCGC CGCCGAGCAA TACTCCCTCG
GCCTCGCCCG AGACCACTCC GACCTCAACC GAACTCGCGG ACCGTGTGAC CACTCTGCAA
GAGCAGCAGG CGCTGCTGTC GACGAGAGTT GACCAGCAAT ACCAGACGAA GGTTGAGAGC
GGATCGAAAT ATCGCGTACG TCTTTCTGGG ATGGTGCTGT TCAATGCCTC GGGAACTCGC
GGCGAAGTGG ACGATCAGGA TGTTCCGATG CTCGCCGAAG GGCACACACC TGGACACTCC
GGCGGAAATA TTTCAGCCAC CATGCGCCAG ACGTTCATTA ACCTCGACCT CTTCGGTCCT
GACCTGGCGG GTGCACGCAC GTCTGCCTCA ATGCAATTCG ACTTCATGGG TGGATTCCCG
AATACGCTTG ATGGCGTTGC GATGGGCATC GTGCGCATGA AGGTCGCGAA GGCCCAACTC
GACTGGCAGA ACTGGTCGTT GAGCGTCGGC CAAGACAAAC CGTTTATCTC ACCGTATTCG
CCGACCTCTC TCGCAACTAT CGGAACGCCT AGCTTTGGGT ATTCCGGAAA TCTCTGGACC
TGGACACCAC AAATCGTCGC CGAGCGCCGA TGGAAACCGT CGGAGAGCCT ATCAACCAAG
CTGCAATTTG GCATGCTTGA TCCCCTTAGT GGCGAACTGC CCGGGGATTC CTTTGGCCGG
TATCCGGAAT CTGGGGAACG GTCGCGCGTT CCGGCGTTTG CGGCGCGCCA GAGTTTCGAT
TTCGGAAGCG GTACAGAAAA ATCATCGATC GGGTTTGGCG GCTATTATGC GCGTCATGAC
TTCGACTTCA ACCGAACGGT TGACGGTTGG GCGGCAACGC TCGATTGGAA GGTCGCACTG
GGGCGTTATT TCGAAGATAG CGGCGCCTTC TACCGTGGGC GCGCGGTGGG CGGTCTTTGG
GGCGGCATCG GGACGACGGC GGTGATGGAC GGTTACCTAA GCGATCCGCT GACCCACGTC
TATCCGGTCA ACAGCATTGG GGGATGGTCG CAGCTTAAGT ACAAGCCTGC TCCCAAGTGG
GAAATTAACG CGGCCTTCGG CGAAGACAGT CCATTTGCGG CTGACCTTCG GCTGGACAAC
GCACCATACT CGTACCGCCC GTATTTGCGG AACTGGACTA CTATGTTCAA TGTGATCCAG
CGGCCTCGAT CGAACCTTAT GTTCTCGCTC GAGTACCGCC ACCTGAATAG CGTCGAATTC
AGCGGCCAAC GAGATACGGC CGAGCACGTC AATCTAGGTG TAGGAGTGAT CTTCTAA
 
Protein sequence
MRRMAVTISA LLFCVSTSLA QQNATTDPDL QELKQQLRDV VSSLQETRSE LKESQRQIQA 
LQTEVASLRA TNAPPSNTPS ASPETTPTST ELADRVTTLQ EQQALLSTRV DQQYQTKVES
GSKYRVRLSG MVLFNASGTR GEVDDQDVPM LAEGHTPGHS GGNISATMRQ TFINLDLFGP
DLAGARTSAS MQFDFMGGFP NTLDGVAMGI VRMKVAKAQL DWQNWSLSVG QDKPFISPYS
PTSLATIGTP SFGYSGNLWT WTPQIVAERR WKPSESLSTK LQFGMLDPLS GELPGDSFGR
YPESGERSRV PAFAARQSFD FGSGTEKSSI GFGGYYARHD FDFNRTVDGW AATLDWKVAL
GRYFEDSGAF YRGRAVGGLW GGIGTTAVMD GYLSDPLTHV YPVNSIGGWS QLKYKPAPKW
EINAAFGEDS PFAADLRLDN APYSYRPYLR NWTTMFNVIQ RPRSNLMFSL EYRHLNSVEF
SGQRDTAEHV NLGVGVIF