Gene Cphamn1_1398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1398 
Symbol 
ID6375076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1513158 
End bp1514852 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content52% 
IMG OID642683893 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001959807 
Protein GI189500337 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000129983 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCTCTG CACCATGGAT CGAGCATTAT GACAAGGGAG TTCCTGCTTC TCTGTCGCCT 
TATCCACATC ATACCATAGT CGATATTGTC CGAGACCGGG CGGCTGATTA TCCCGACAGG
ACAGCGTTTT TTTTCAAGGG ATCTTCCATG TCATGGTCCG AGCTGGACAG GCTGAGCAAT
GCGCTCTCGG CCGCGCTGGT TTCAGAAGGG CTGAAGAAGA CAGACCGTGT GGCGCTGTTG
ATGCCGAATT CTCCTCAGAT GATACTCAGT GAACTGGCTA TCTGGAAGGC GGGAGCTGTA
GCGGTACCGA TGAACCCTCT CTATACCGGT CATGAGCTGG AGCATGCAAT GAAGGAGTGC
GGGGCGGAGA CAGCTATAGT TCTTACCCCG TTTTACCGGA AGATCAGATC GTTGCAGGCT
TCTATTGGTC TGCGAAAGAT CATAGCGACC AATATCAAGG AATATCTCTC GCCGTTAAAA
AAAGTGCTTT TTTCTCTTCT GAAGGAGAAG AAAGATGGCC ATGCCATAGT GCTTGAACCG
GGTGATCTCT GGCTTGGAGA GATGATTGCC GGACATCGTG ACGCACCTTG TCCTGAAAGA
GGCGTAACGC CGGAAGACAT GGCTTTATTT CTGTTCACCG GCGGTACTAC CGGACTGCCC
AAATGCGCGG TATGTACCCA TAAGGCTCTG GTGGTCAGCG GGATGCAGAT CGCACGGTGG
TTCAGTGTCG TGCTTGACCG GGGGGAGGAT ATCATAATGC TCAATATGCC CCTGTTTCAT
GTTTATGCTC AGGTGGGGAT ACTTGGAGCA GCTATTGTCG ATGATTATCC TTTCGCGCTT
GTTCCGAACC CTCGTGATCT CGATGATCTG CTGGTTACCA TCAAAAAGCT TAAACCGGCA
GTTCTTCCGG GTGTTCCGAC ACTCTTTTCA GGTTTGATCA ATCATCCAAG GACACGCAAA
GACAGCACTG TTCTGGGTTC ACTGAAGCTC TGTGTTTCCG GAGCGGCCCC TTTGCTGCTT
GAAACCAAGA AGCGGTTTGA GGAGCTGACA GGAGGCAGAA TCATAGATGC ATACGCTCTT
ACCGAGTCGA TGATCGGTTC CGTTCTTACC CCCGTGCTGG GAACCTACAA AGAGGGCTCC
GTGGGCATAC CTGCTCCGGA TGTGGAGATT CGTATAGTCG ATCAGGAGAG TGCGTCCCGT
GAATTGCCGT TTCATGAAGT CGGTGAAGTG ATAATGCGCG CTCCCCAGCT CATGAAAGAG
TACTGGAAGC GCCCCGAAGA AACCATGTCG ACTATTCGTG ATGGCTGGTT GTACACCGGT
GACCTTGGTT ACCTCGATGA TGACGGCTAT CTGTTCATCA TCGACAGAAA AAAGGATGTC
ATCAAGCCTG GCGGTTTTCA GGTTTGGCCG CGGGATGTTG AAGAGGTTAT CGCGTCACAT
CCGGACGTAG TAGAAGTCGG CGTTGCGGGA GTTCCGGACG ACTATCAGGG AGAAGCCGTC
AAAGCCTGGG TCGTGCTGCG TGAGGAGTGT GTGCTCGATG CCGAAACGTT GCGTGAGTTC
TGTAAAAAGG AACTGGTGGC CTATAAGGTG CCGAAGTATA TTTCTTTTAC GGAGTCCCTG
CCGAAAACCC TGGTAGGAAA GGTGCTGCGC CGCAAGCTTG TCGAAGAGCA TTGCCAGGCT
GCTGCGAACG GGTGA
 
Protein sequence
MSSAPWIEHY DKGVPASLSP YPHHTIVDIV RDRAADYPDR TAFFFKGSSM SWSELDRLSN 
ALSAALVSEG LKKTDRVALL MPNSPQMILS ELAIWKAGAV AVPMNPLYTG HELEHAMKEC
GAETAIVLTP FYRKIRSLQA SIGLRKIIAT NIKEYLSPLK KVLFSLLKEK KDGHAIVLEP
GDLWLGEMIA GHRDAPCPER GVTPEDMALF LFTGGTTGLP KCAVCTHKAL VVSGMQIARW
FSVVLDRGED IIMLNMPLFH VYAQVGILGA AIVDDYPFAL VPNPRDLDDL LVTIKKLKPA
VLPGVPTLFS GLINHPRTRK DSTVLGSLKL CVSGAAPLLL ETKKRFEELT GGRIIDAYAL
TESMIGSVLT PVLGTYKEGS VGIPAPDVEI RIVDQESASR ELPFHEVGEV IMRAPQLMKE
YWKRPEETMS TIRDGWLYTG DLGYLDDDGY LFIIDRKKDV IKPGGFQVWP RDVEEVIASH
PDVVEVGVAG VPDDYQGEAV KAWVVLREEC VLDAETLREF CKKELVAYKV PKYISFTESL
PKTLVGKVLR RKLVEEHCQA AANG