Gene Caci_6157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6157 
Symbol 
ID8337520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7085052 
End bp7087808 
Gene Length2757 bp 
Protein Length918 aa 
Translation table11 
GC content67% 
IMG OID644959258 
ProductFibronectin type III domain protein 
Protein accessionYP_003116852 
Protein GI256395288 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.622603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.669224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGAC GGAAACGTTG GATCGCGGGC GCCGCGTTCC TCGCCCTCAT CGCGGGCGGG 
AGCATCGGAG CCGCACAGGG GCTGCAGAGT ACGAGGCCGT CATCCGGCCC CGGCTCGTGC
ACCCTCATCG GCTGGAACCC GAGCACCGAC CCGACGAACG CCAAGAACCT GCCGGTCGGC
CAGCGGCCCC AGTCCTACCG GCCCGACAAC TTCGACTGCT CCAGCGCCAA GTTCGCCGCG
CTCGGCTCGG AGTTCACGAA GTTCCCGCAG CCGCACGACT TCTCGGTGAA CAACCGCATC
ACCTACGAGC CCGCCGCCGC CGGCAAGCCG GCCACGGCGA TCCAGCAGCC GGTGGCGGCG
GTGAACCCGC TGACGCCGTA CTTCCCGCCG TTCCAGCACT TCGTCATCGT CTACCGCGAG
AACCACACGT TCGACGACTA CCTCGGCGAC TGCGCCACCA CCATCTCGGC CTCCTGCAAC
GGCAAGGTGC AGAGCACCAA CCACATCAGC TCGGTGCCGG ACCTGCACAG CCTGGCCAAG
ACCTACGCGC TGTCGGACTC CTACAGCACG GGCACGCAGC CGCCCTCCGG TCCCAACCAC
TGGTGGCTGT TCTCGGCCCA GTCGCAGTCC AGCTCCCAGC AGCAGACCTA CCCCTCCACC
GGGACGGAGT TCGACCGCTT CCTGAGCAAC ACCAACGGTC CCACCAACGA GGGCACCAAC
GCCTGCACGG CGCCGACCGG CAACGGCAGC GGCTCCAGCC CGTACACGTT CGTCATGAAC
GGCGACTTCT ACTGGATGCT CAGCAGCGGC TCGGGCTACT GGAAGAACCC CGCCGACGGC
AAGACCGAGG TCCTGCCCCC CAACCGCCCG GGCACGAGCA TCCCGGAGGA GCTGCACTAC
AACGAGTACA CCTGCTCCAA CCAGAGCCTT CCGGACAGCA CCATCGCCAA CGACTACATC
AACTTCGTCA ACACCAACGG CCTGCCGGTC TACAGCTACG TCGAGCTGTT CAACGACCAC
CCGGGCACCT ACCAGGACAT CCCGGGCAAC GACACCGCGA CCAACAACGT GGTCAACGCG
ATCGAGAACA ACGCGACGTA CAAGAACAAC ACCCTGATCA TCGTCACCGA GGACGACACC
CAGAACGGCA ACAACGGCAC CGACCACGTC AGCAACACCT ACCGGGTCCC GCTGGTCGTC
ATCGGCCCGT CGCAGTACGT CAAGCAGGGG TATGTCAGCC ACGTCGCGTA CACGACCAAC
AACGTGATCG CGGCGATGGA ACGGACGATG CAGAACGTCA AGGCCGGCAT CATCGACCCC
AACGACAACA TCGGGCTGAA CACCTTCCCG ATGACCACCA ACGACCAGGC CGCGCTCGGC
GACCCGCTGG AGGACTTCTG GGTCCAGGGC TCGACCCCGC TGTCCGCCTC GGCCACGGCC
TCCCCGACCA CCGGCAACGC ACCGCTCGCG GTGAACTTCA CCGGCTCGGC GACCGGCGGC
ACGGCGCCGT ACAAGTACAG CTGGAACTTC GGTGACGGCG CGACGAGCAC CACGCAGAGC
CCCAGCCACA CCTACAGCTC CGCGGGCAGC TACACCGCGA CGCTGACGGT CACCGACACC
TCCTCCCCGG TCAAGACGGC GACCTCCCAG GTCGCGGTGA ACGTCAGCTC GGTCGGCAGC
CCGCTGGCGG CCTCGGCGGC GGGCACTCCG ACCTCCGGAC AGATCCCCTT GGCCGTGAAC
TTCACGGGAA CGGCGACAGG AGGCACTCCC GCTTACCACT ACAGCTGGAA CTTCGGTGAC
GGCTCCGCGA CGAGCACGGC GCAGAACCCG AGCCACAGCT ACACCGCCGC CGGGACGTAC
ACGGCGACCT TGACGGTGAC CGACAGCGCC TCCCCGGTGA ACACGGCCAC GTCGACGGTC
AGCGTCACGG CGTCGCCGAT CATGGGCACG CCGCCCAGCG CGCCACAGAA CCTCACCGCG
GCAGCGGGTA CCAACCAGGT GACGCTGAAC TGGCAGGCGC CGGCGAGCAG CGGCGGTGAG
AACATCACCA AGTACAGCGT CTACCGCGGC ACGTCCAGCG GCACGGAATC GCTGGTGACC
TCCGGCGGCT GCAGCGGCGT GAGCGGCAGC ACTTTGACGT GCACGGATAC TGGGCTAACC
AGTGGTGTTG ACTACTACTA CCGCGTCACC GCGTCGAACC CCATCGGCGA GGGCGGGCAG
AGCAACGAGG TCACCGCCAC GCCGACCGGC AGCACCGGAT GCACCGCCGG GCAGCTGCTG
GGCAACCCGG GCTTCGAGAA CGGCGCGTCC AACCCGGCGC CGTGGGCGAT CACCTCGACG
CACACCCCGC TCTCGGTGAT CAACAGCAGC AGCTCCGAGC CGCCGCACGG CGGGACCTAC
GACGCGTGGA TCGACGGCTG GGGCAAGGCG ACCACGGACA CGTTGGCCCA GACGGTGACG
CTGCCGTCGG GGTGCACGAC CGAGAAACTC AACTTCTATA TGCACATCGA CACCGCGGAG
ACCACGACCA CCACCAAGTA CGACACGCTC AAGGTGCAGG TCCTCAACCC GGCGGGCACC
GTGCTGGGAA CCCTGTACAC GTACTCGAAT CTGAACGCCA ACACCGGATA TTCGCTGCAC
TCGCTCAGCC TGGCGTCCTA CGCCGGGCAG ACGGTGACGC TGAAGTTCAC CGGCGTTGAG
GACGCCGAGT ACCAGACCTC GTTCGTCGTC GACGACGCCA CGGTCAATGT GAGCTGA
 
Protein sequence
MIRRKRWIAG AAFLALIAGG SIGAAQGLQS TRPSSGPGSC TLIGWNPSTD PTNAKNLPVG 
QRPQSYRPDN FDCSSAKFAA LGSEFTKFPQ PHDFSVNNRI TYEPAAAGKP ATAIQQPVAA
VNPLTPYFPP FQHFVIVYRE NHTFDDYLGD CATTISASCN GKVQSTNHIS SVPDLHSLAK
TYALSDSYST GTQPPSGPNH WWLFSAQSQS SSQQQTYPST GTEFDRFLSN TNGPTNEGTN
ACTAPTGNGS GSSPYTFVMN GDFYWMLSSG SGYWKNPADG KTEVLPPNRP GTSIPEELHY
NEYTCSNQSL PDSTIANDYI NFVNTNGLPV YSYVELFNDH PGTYQDIPGN DTATNNVVNA
IENNATYKNN TLIIVTEDDT QNGNNGTDHV SNTYRVPLVV IGPSQYVKQG YVSHVAYTTN
NVIAAMERTM QNVKAGIIDP NDNIGLNTFP MTTNDQAALG DPLEDFWVQG STPLSASATA
SPTTGNAPLA VNFTGSATGG TAPYKYSWNF GDGATSTTQS PSHTYSSAGS YTATLTVTDT
SSPVKTATSQ VAVNVSSVGS PLAASAAGTP TSGQIPLAVN FTGTATGGTP AYHYSWNFGD
GSATSTAQNP SHSYTAAGTY TATLTVTDSA SPVNTATSTV SVTASPIMGT PPSAPQNLTA
AAGTNQVTLN WQAPASSGGE NITKYSVYRG TSSGTESLVT SGGCSGVSGS TLTCTDTGLT
SGVDYYYRVT ASNPIGEGGQ SNEVTATPTG STGCTAGQLL GNPGFENGAS NPAPWAITST
HTPLSVINSS SSEPPHGGTY DAWIDGWGKA TTDTLAQTVT LPSGCTTEKL NFYMHIDTAE
TTTTTKYDTL KVQVLNPAGT VLGTLYTYSN LNANTGYSLH SLSLASYAGQ TVTLKFTGVE
DAEYQTSFVV DDATVNVS