Gene Caci_2566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2566 
Symbol 
ID8333915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2905713 
End bp2907263 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content70% 
IMG OID644955719 
Productextracellular repeat protein, HAF family 
Protein accessionYP_003113325 
Protein GI256391761 
COG category[S] Function unknown 
COG ID[COG5563] Predicted integral membrane proteins containing uncharacterized repeats 
TIGRFAM ID[TIGR02913] probable extracellular repeat, HAF family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCGT CCGGCAGGCG TTCGGCAGGC GCCGTCGCCC GTGTCGCAGT ACTCTGCGCC 
GCTCTGGTCG TGCCCGTCCC GTTCGCGGTA TCGGCATCGG CTGGTGTGGC CGGCGCGGCC
AGTGCGGCCA GTGCGGCCAC CGCGCCGCTC TCCACCATCA CCGACCTCGG CACGCTCGGC
GGTGATCTCA GCATCGCCAA CGGGATCAAC AACGCCGGGG TCGTCGTCGG CTACAGCGAT
CTGGCTTCCG GCACTCAGCA CGGCTTCCGC TGGTCCGGAG GGACCATGTC CGACCTGGGG
GTCGAGGCCG GGGGCGGGGA CAGTGTGGCG AACGCCGTCA ACGACGCCGG TCAGATCGCG
GGCCAAGCGA CGCGCGCCGA CGGCGGTTAC GCGTATCCGG TCCGCTGGAG CGCCGCCGGC
GTGCTGCAGG ACCTCGGCGG CCCGATCACC AACCGGCTGG GCGTCGGCAA CGCCATCGAC
CCCTCCGGCC GCGTCGCCGG CGGTCAGCGT CCGGCCGACT CCGAGGGCAG CCCGGAGGCG
ATCGTCTATG ACGCCGCCGG CAACCCCACC GAGCTGAGTA CGCCGACGCA GACCCTCAAC
GCGGCCACCG GCATCAACGC GCGCGGGCAG GTCGTCGGCG GTCCGGCGTT CGTCTGGCAG
AACGGGTCCC TGACCATGCT GCCGGTGCTG CCCGGCGGTC AGGGCGGATC GGCCAACGCC
ATCAACGTCT CCGGCACGAT CGTCGGCACC GTCAGCCGAA CCGGCACGCT CAGCGGTCTG
GACGCCGCGC TCTGGCAGAA CAACACCCTG ACAGACCTCG GCACGATCGA CGCGATCCAG
TACAACCAGG CGACCGCGGT CAACGCCGCG GGCCAGATCG TCGGTACCGC CGACCCCGAG
TGTCAGCCGT GCGCCGCACC GGAGGCGTGG CTGCGCCAGC CGGGCGGCGC GCTGACGAAG
CTGGACACGC TGCTCCCCGC CGGCTCCGGC TGGACCCTCC AATCAGCCAC CGGGATCAAC
GACCGCGGCC AGATCGTCGG CGTCGGCCTC CACAACGGCC ACAAGCGCGG CTACCTGCTC
ACCCCGGCGT TCGCCGCGAC CGTGAACTTC GAACCGGCCG GCTCGACGAT CCCGGTGGGC
TACGCGGCGG ACACCGGCGC CGCGTACGGT GCGCGGTCCG GCGGCCTGAC CTACGGCTGG
AACATCGACA ACTCCGTGAA CACGAGGGAC CGCAACGCCT CGAGCTCCCC GGATCAGCGC
TATGACACGC TGATCCACAT GGAGCGCAGC GGAAGCGCGA CGGTGTGGGA GATGGCGGTG
CCGAACGGCC ACTACACGGT GCACCTGGTC TGCGGCGATC CGTCGAACAC CGACAGCGTC
TACAAGGTCA ACGTGGAGGG CGTGCTCACA GTCTCAGGGA CGCCGAGCGC CGCCAGCCAC
TGGATCGAGG GGACCAGCCA GGTCACGGTC TCCGATGGCA AACTGACCAT CACCAACGCC
ACCGGATCGA GCAACGACAA GCTCGCTTAC GTGGACGTCA TCGCTTCCTG A
 
Protein sequence
MHSSGRRSAG AVARVAVLCA ALVVPVPFAV SASAGVAGAA SAASAATAPL STITDLGTLG 
GDLSIANGIN NAGVVVGYSD LASGTQHGFR WSGGTMSDLG VEAGGGDSVA NAVNDAGQIA
GQATRADGGY AYPVRWSAAG VLQDLGGPIT NRLGVGNAID PSGRVAGGQR PADSEGSPEA
IVYDAAGNPT ELSTPTQTLN AATGINARGQ VVGGPAFVWQ NGSLTMLPVL PGGQGGSANA
INVSGTIVGT VSRTGTLSGL DAALWQNNTL TDLGTIDAIQ YNQATAVNAA GQIVGTADPE
CQPCAAPEAW LRQPGGALTK LDTLLPAGSG WTLQSATGIN DRGQIVGVGL HNGHKRGYLL
TPAFAATVNF EPAGSTIPVG YAADTGAAYG ARSGGLTYGW NIDNSVNTRD RNASSSPDQR
YDTLIHMERS GSATVWEMAV PNGHYTVHLV CGDPSNTDSV YKVNVEGVLT VSGTPSAASH
WIEGTSQVTV SDGKLTITNA TGSSNDKLAY VDVIAS