Gene Acid345_0902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0902 
Symbol 
ID4069113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1129639 
End bp1131156 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content53% 
IMG OID637982909 
ProductAlpha-L-fucosidase 
Protein accessionYP_589979 
Protein GI94967931 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.13799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC CGCTGGTGGT GCTCGCATTT CTTGCTATTG GAACTCTGTT AACCGAAGCG 
CAGGAGCAAC CTGCAAATGT TCAACGCGAA CTGAAGTCCA TTGACCACAC TGTCGCTGCA
GGGCCGTTCC GCGATGACTG GAATTCCCTG AAACAAATGA AAGTTCCCGA TTGGTTTCGC
GACGCAAAAT TCGGGATCTT CATTCACTGG GGTGCGTACT CCGTGCCGGC GTGGGGCAAC
GAGTGGTATC CGCGAAACAT GTATCAGAAG AAAGAGCCGG AGTTCGCTCA CCACATCGAG
ATTTACGGGC CGCAGAACAA ATTTGGCTAC AAAGATTTTC TGCCACAATT CACGGGAGAC
AAGTTCGACG CCGATCAGTG GGCCGACCTG TTTCGTCGCG CAGGAGCGAA GTACGTGATC
GAAGTCGCCG AGCATCATGA CGGCTTCGCG ATGTACGACA CTCAATTCAG CGAGTGGAAA
GCTACAAAGA TGGGACCGCA TCGTGACGTG ATCGCCGAAC TTGGCCCGGC GGTTCGCAAG
GCTGGATTGC ATTTCGGAAT CTCGTACCAC CGCGCGGAGC ACTGGTGGTT TTTCGATGGC
GGCAGAACGT TCGACTCCGA TGTAAACGAC CCGAAGTATG CGAGTTTTTA CGGCCCGGCC
CAGCCTCAAC GCTTGCCTGG TGCAGATCGT GACAATCAGC CAGATGCCGC GTTCCTTTCC
GACTGGCTTG CCCGCAGTGG AGAAATTGTC GAGAAGTATC ACCCGGACCT CTGGTATTAC
GACTGGTGGG TCGGCCAGCC GGCGTTCGAA CCCTATCTGC GGAAATTCGC AGCGTTCTAT
TACGACGAAG CAGCCAAGGC GCAGCAGCCG GTTGTCATGT ACTACAAGGA AAAAGCAATG
CCGCAAGGCA CTGCGGTTCA CGACGTCGAG CGTGGCAAAC TCGCGGCCAT CCAACCCGAA
CCTTGGCAAA CTGACACTTC GATCAGCTAC AAATCATGGG GCTACCTGAA AGATGACTCG
CTGCGAACGC CAAAATCCAT CGTGCAAGAC TTGGTTGATA TCGTTAGCAA GAATGGGTGC
TTGCTATTAA ATGTCGGTCC GAAGCCAGAT GGAACGATTT CAGAGGACGA GCAGAAGATC
TTGCTCGCAG TCGGAGAGTG GCTCTCTCTG AACGGAGAGG CTATTTACGG CTCTCGTCCG
TGGACGACTT TCGGCGAAGG TCCGACCCAG GTGAAAGAAG GCAGCTTAAA CGAAGGCGAG
CAGAAGCCGT TCACTTCGGA CGATATACGC TTTATTCGGA ACCATGAGGC GCTGTATGCA
ATTTCGATGG ATTGGCCGAA AGATGGAAGC ATACGGATAC GGTCACTCGA CAAGGAATTG
AAAGTGAGGA GCGTGTTGCT GGTGGGATCG GCAGGCAAGG TATCTTTCAA ACAAGATGCC
GAGGGGCTTT GGGTCGAACT GCCTAAGGAT ATAAGTGATG AACTGCTTCC GGTTTTGAAA
ATCGAATTTC GCCAATAG
 
Protein sequence
MNKPLVVLAF LAIGTLLTEA QEQPANVQRE LKSIDHTVAA GPFRDDWNSL KQMKVPDWFR 
DAKFGIFIHW GAYSVPAWGN EWYPRNMYQK KEPEFAHHIE IYGPQNKFGY KDFLPQFTGD
KFDADQWADL FRRAGAKYVI EVAEHHDGFA MYDTQFSEWK ATKMGPHRDV IAELGPAVRK
AGLHFGISYH RAEHWWFFDG GRTFDSDVND PKYASFYGPA QPQRLPGADR DNQPDAAFLS
DWLARSGEIV EKYHPDLWYY DWWVGQPAFE PYLRKFAAFY YDEAAKAQQP VVMYYKEKAM
PQGTAVHDVE RGKLAAIQPE PWQTDTSISY KSWGYLKDDS LRTPKSIVQD LVDIVSKNGC
LLLNVGPKPD GTISEDEQKI LLAVGEWLSL NGEAIYGSRP WTTFGEGPTQ VKEGSLNEGE
QKPFTSDDIR FIRNHEALYA ISMDWPKDGS IRIRSLDKEL KVRSVLLVGS AGKVSFKQDA
EGLWVELPKD ISDELLPVLK IEFRQ