Gene Caci_4136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4136 
Symbol 
ID8335490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4675756 
End bp4677390 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content68% 
IMG OID644957239 
ProductCurculin domain protein (mannose-binding) lectin 
Protein accessionYP_003114841 
Protein GI256393277 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.489882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTCT CCCGACGCAC CCTGCTGTCC TCAGCCCTGG GCGGCGCAGC GCTGGCCACC 
GTGGCCGGGA CCGAACTGAC CTCGGCGTTC GCCGGTCCGG CGCCGGCGGC GGCGCCGGCC
GCCTCGCCCG CCGGCGACGT GGTCGGCAAG ATCACCGTCG GCTACCAGGG CTGGTTCGCC
TGCATCGGCG ACGGAGCGCC GATCAACGCC TGGTGGCACT GGAGCCAGAA CCAGGGGCAG
GCGCCCTCGC CGAGTAACCA GAACCTCAAG GCGTGGCCCG ACATGAGCGT CTACAGCGCG
GGCTACCGGA CCGGATTCGC CAATCTGGGC AACGGTTCCG CGCCGAACTT GTTCTCCTCC
TACGACCAGT CCACGGTCAA TGCCCACTTC TCGCTGATGC AGCAGAACGG CTGCGACACC
GCGGCGCTCC AGCGCTTCAA CCCCAACGGC AGCGAGGGCC CGACCCGCAA CGCGGTCACC
GCGAAGGTCA ACACCGCCGC GCAGCAGTAC GGCCGGAAGT TCTACATCAT GTACGACGCC
TCCGGCTGGA CGAACATGAA GACCGAGATG CCCGCCGACT GGACGAACGT GATGAAGCAG
TACGCCTCGT CCCCGGCGTA CGCGCACCAG AACGGCAAGC CGGTCGTCGG CATCTGGGGC
TTCGGCTTCA ACGACGCCAA CCACCCCTGG TCCGCCGCCG ACTGCCTGTC GGTCGTGCAG
TGGTTCCAGA GCCAGGGCTG CTACGTGATG GGCGGCGTGC CGACGTACTG GCGCACCGGC
GTGAACGACT CGCGCTCCGG ATACAGCGGC GTCTACTCCG CGTTCAACAT GATCTCGCCG
TGGATGGTCG GGCGCATCGG CTCGGTCAAC GATTCGAACA ACTTCTATAC GAACGTCAAC
GTCGGCGACC AGTCCTACTG CAACTCCCAC AACATCGACT ACCAGCCGTG CGTCCTGCCC
GGCGACCTGT CGGCCCGCCA GCGCGCGCAC GGCGACTTCA TGTGGGCGCA GTTCTACAAC
ATGGTGCGCG TCGGCGCGCA GGGCATCTAC ATCTCCATGT TCGACGAGTA CGGCGAGGGC
AACCAGATCC TCAACACCGC GCCGACCCAG GCGTTCGTGC CGACCAACTC CGGGCTGCTC
TCCCTGGACG AAGACGGCAC GGCGTGCAGC GCGGACTACT ACATGCGCCT GACCAACGAC
GGCGGCAGGA TGCTCAAGGG ACAGATCGCG CTCACCCCGA CACGTCCCAC CGCGCCCGGC
GGCACCAACG GCGGTGGCGG CGGGACCGGC GGCGGGTGTG GCCAGCTGAC CGCCAACCAG
CAGCTCACCG CGAACCAGTC CACGCTCTCC TGCGACGGCC GGTTCAAGCT GATCCTGCAA
GGCGACGGCA ACCTGGTGCT CTACCAGGGC AGCGCGGCGC TGTGGGCGTC GAACACCGTG
GGGAAGGCCG CCGCCAAGGC CGTCCTGCAG GGCGACGGCA ACTTCGTGAT CTACGACACC
GGCGGCGCGC CGCTGTGGGC CAGCAACACG GCCGGGAACA ACGGCGCGCA CCTCACGGTG
CAGAACGACG GCAACACCGT CATCGTCAGT TCCGCCGGCG CGACCCTGTG GAGCACCGGG
ACCGGCGGAC ACTGA
 
Protein sequence
MTLSRRTLLS SALGGAALAT VAGTELTSAF AGPAPAAAPA ASPAGDVVGK ITVGYQGWFA 
CIGDGAPINA WWHWSQNQGQ APSPSNQNLK AWPDMSVYSA GYRTGFANLG NGSAPNLFSS
YDQSTVNAHF SLMQQNGCDT AALQRFNPNG SEGPTRNAVT AKVNTAAQQY GRKFYIMYDA
SGWTNMKTEM PADWTNVMKQ YASSPAYAHQ NGKPVVGIWG FGFNDANHPW SAADCLSVVQ
WFQSQGCYVM GGVPTYWRTG VNDSRSGYSG VYSAFNMISP WMVGRIGSVN DSNNFYTNVN
VGDQSYCNSH NIDYQPCVLP GDLSARQRAH GDFMWAQFYN MVRVGAQGIY ISMFDEYGEG
NQILNTAPTQ AFVPTNSGLL SLDEDGTACS ADYYMRLTND GGRMLKGQIA LTPTRPTAPG
GTNGGGGGTG GGCGQLTANQ QLTANQSTLS CDGRFKLILQ GDGNLVLYQG SAALWASNTV
GKAAAKAVLQ GDGNFVIYDT GGAPLWASNT AGNNGAHLTV QNDGNTVIVS SAGATLWSTG
TGGH