Gene Caci_5944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5944 
Symbol 
ID8337306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6861628 
End bp6863226 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content69% 
IMG OID644959048 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003116643 
Protein GI256395079 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.420174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.713256 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCC AGAATGCGTG GCACCCGAGC GGTGACCAGT CCATTGCCCG GCGCGGTACG 
CTTGCCGACT TGTTGCGTCG TAGTGCTGCG CGGGAGCCTG GCAAGCTGGC GCTCGTCTTC
GGTGGGGTGC GGCAGACTTT TGCCGAGCTT GATGTGACCG TCAGTCGGGC CGCGAATGCC
CTTGCTGAGC GGGGTGTGCG GTACGGGGAT CGGGTCCTGT TGCTCGCGCA CAATTCGCAC
GGCTTCGTCG TCGCCTACTT CGCGCTCGCG CGGCTCGGTG CGGTCTCCGT GCCGGTGAAT
TTCATGCTGG GTCCCGATGA GATCGCCTAC GTTCTCACGC ATTCCGGCGC CGTCGCGGTC
ATCGCTGAGG ACGCGCTGGC CGACACCGCT GACCGCGCGT GCCAGGTCGC CGGCATCGTG
CCTGTCGTCA GGGCCGCGAT CAGTAGCGGC GCCACCGAGA CCCCGGAAGG CTGGCTCGAC
TTCGAGACCG CCTACCGAAA CGCCTCCGCC GACGAGCCCG ACGCCCCCGT CACCGACGAC
GACCCCGTCC AGATCATGTA CACCTCCGGC ACGGAGTCAC GTCCCAAGGG CGCCGTCATG
TCCACCCGGA ACCTGATCGC GCAGTACACC AGTGCCATCG TCACCGGCGC CATGTCCGCC
GACGACATCG AGGTCCACGC CCTCCCGCTC TACCACTGCG CGCAGCTTCA CTGCTTCCTC
ACCCCCGACA TCCAGCTCGG CGCCACCAGC ATCGTGCTCC CCGGCGCCGA TCCCGCGACG
ATCCTGCGCA CCGTCGAGCT GGAGCACGTC ACCAAGCTCT TCTGCCCGCC GACGGTCTGG
ATCGCCCTGC TGCGCCATCC CGATTTCGAC GCCCGCGATC TCAGTACCCT GCGCAAGGGC
TACTACGGTG CCGCCGCGAT GCCGGTCGAG GTCCTGGCCG AACTGCGCCG CCGGCTTCCC
GAGCTGCGGC TGTACAACTT CTACGGCCAG ACCGAGATGT CCCCGGTCGC CACCGTGCTC
GGTCCGGAGG ACCAGGAACG CAAGCCCGGC TCGGCCGGCC GTGCCGCGCT CAACGTCGAG
ACCCGCGTGG TCGACGACGC CGGGAACGAG GTCCCGCGCG GCGAGGTCGG CGAGATCGTG
CACCGCGGCC CGCACACGAT GCTCGGCTAC TGGAACGACC CCGAGCGCAC CGCCGAGGCC
TTCCGCGGCG GCTGGTTCCA CAGCGGCGAC CTCGGCGTCA TGGACGAGGA GGGCTACCTC
GCCGTCGTGG ACCGGAAGAA GGACATGATC AAGACCGGCG GGGAGAACGT CGCGAGCCGC
GAGGTCGAGG AGACCGTCTA CCAGCACCCG GCGGTCGCCG AGGTGGCGGT GTTCGGCGTG
CCGGATCCGT ACTGGATCGA GATGGTCTGC GCGGCGGTGG TGGTCAAGCC GGGGGAGCGG
CTGGAGCCGG AGGAGGTCGT CGAGTTCTGC CGGGCGCGGC TGGCGGGGTT CAAGACGCCC
AAGAAGGTGG TCATCGTCCC CGCGCTCCCG AAGAACCCCT CCGGCAAGGT CCTCAAGCGC
GAACTGCGCG AGATCCACGC TGCCTCTGAC AGCGCATGA
 
Protein sequence
MTAQNAWHPS GDQSIARRGT LADLLRRSAA REPGKLALVF GGVRQTFAEL DVTVSRAANA 
LAERGVRYGD RVLLLAHNSH GFVVAYFALA RLGAVSVPVN FMLGPDEIAY VLTHSGAVAV
IAEDALADTA DRACQVAGIV PVVRAAISSG ATETPEGWLD FETAYRNASA DEPDAPVTDD
DPVQIMYTSG TESRPKGAVM STRNLIAQYT SAIVTGAMSA DDIEVHALPL YHCAQLHCFL
TPDIQLGATS IVLPGADPAT ILRTVELEHV TKLFCPPTVW IALLRHPDFD ARDLSTLRKG
YYGAAAMPVE VLAELRRRLP ELRLYNFYGQ TEMSPVATVL GPEDQERKPG SAGRAALNVE
TRVVDDAGNE VPRGEVGEIV HRGPHTMLGY WNDPERTAEA FRGGWFHSGD LGVMDEEGYL
AVVDRKKDMI KTGGENVASR EVEETVYQHP AVAEVAVFGV PDPYWIEMVC AAVVVKPGER
LEPEEVVEFC RARLAGFKTP KKVVIVPALP KNPSGKVLKR ELREIHAASD SA