Gene Francci3_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1000 
Symbol 
ID3906686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1189981 
End bp1191759 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content64% 
IMG OID637878333 
Product3-hydroxyacyl-CoA dehydrogenase 
Protein accessionYP_480112 
Protein GI86739712 
COG category[I] Lipid transport and metabolism 
COG ID[COG1250] 3-hydroxyacyl-CoA dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATCC AGCTCAACGG CACGAAGGTC GGAGTTGTCG GGCTCGGCAC GATGGGCGCC 
GGCATCGCCG AGGTAATGGC GCGCGCCGGC ATCGAGGTGG TCGGCGTCGA GCTCAACGAC
GAGACGCTGG CCCGTGGGCT CGATCGGATC CGGCACTCCA CGGACCGGGC GATGAGCCGC
GGCAAGTTGA CCAAGGTCGA GCGGGACGCC CTGCTCGCCC GTATCCAGGC CGGAACCGGG
ATCGAGGCCG TGGCCGACTG CCAGCTGGTC ATCGAGGCGA TTCCCGAGCG GATCGAGAAG
AAGCTCGGGC TCTTCGCCGA GCTGGACAGG CTGTGTCCGC CAGAGACGAT CTTTACGACG
AACACCAGCT CCCTGCCAAT CATCACGTTG GCCGTCGCTA CCTCACGCCC CTCCCGGGTT
GTCGGGACCC ACTGGTCCAA CCCCGCACCG GTCATGGGCC TGGTCGAGAT CATCCATACT
GCGGTCACCG ACCCGTCTGT ACTGGAGGAT GTCGGGACGC TCGTCGCGAA GGTCGGAAAG
ACCGCGGTGG TCGCCGGGGA CCGGGCGGGT TTCATCGTGA CCGCCCTGCT GTTCGGGTAC
CTTAACAGCG CTGTCCGGAT GCTGGAGGCG TGCTACGCCA CCCGTGAGGA CATCGACGCC
GCCATGCGGT TCGGCTGCGG TCACCCGATG GGCCCATTGG CACTGCTCGA TCTGATCGGT
CTCGACTCCG CGTACGAGAT TCTCGACTCG ATCTATCACA CCTCCCGCGA CCACCTGCAC
GCACCGGCCC CGCTGCTCAA GCAGCTAGTG ACCGCCGGCA TGCTCGGCCG CAAGACCGGG
CGGGGTTTCT ACACCTATGC CGCCCCTTAC TCCTCCGAGA TCGTCGACAT GGTCGAACCA
CCCCAACTCG GCTTCTTCGC GGTCCCAGGA CGGCCGGTGC ACACGATCGG CGTGGTCGGC
ACCGGCACCA TGGCCAGCGG AATCATCGAG GTCTGTGCCC ACCATGGCTA CAAGGTGGTG
TTCCGTGCAC GGAGCGAGAA GAAAATCGCC GCTGTCCGGA CGAAGATCGA GCGGTCACTG
GACAAAGCGG TCGAACGGGG AAAGATCTCA TCGGACGAGC GTACTTCGAC ACTGGCACGG
GTTCGAGCTT CGACCGATCT GTCCGTCCTC GCCGAGTGTG ACCTCATCAT CGAAGCGGTC
GTTGAGGACT TGGACGTCAA ACGGGCGCTG TTCGCTGAAC TCGACACGGT TGCACGCCCG
GGAGCGGTCC TCGCGACGAC CACGTCCTCG CTGCCAGTGA TCGAGTGTGC CACCGCCACG
TCCCGCCCGC AGGATGTGGT CGGTATGCAC TGGTTCAACC CAGCGAAGAA GATGCGCCTG
GTCGAGATCG TGCCGACCAT CGTAACCGCG GACGACGTGA CCGCGACTGT CTTCGACGTG
GCCAGGACGG CGGGCAAGTA CCCGGTCCGG TGTGCCGACC GCGCCGGCTT CATCGTCAAC
ACCCTACTGT TCCCATATCT CAATGACGCG GTGAAGATGC TGGAGTCCCA TTACGTGGAT
ATCGATGTGA TCGATACAGC CATGAAGGTC GCCTGCGCAC ACCCGATGGG GCCGTTCGAA
CTCGCCGATG TCATCGGCCT GGACGTGACA CTCGCCATCC AGCGCGCCCT GTACCGGGAG
TTCCGCGAAC CCGGGTACAC CCCGACGCTC CTGCTGGAAG ACCTCGTCAG GTCCGGATGT
CTGGGGTACA AGACCGGGCG GGGTTTCCGG GTCTACTGA
 
Protein sequence
MDIQLNGTKV GVVGLGTMGA GIAEVMARAG IEVVGVELND ETLARGLDRI RHSTDRAMSR 
GKLTKVERDA LLARIQAGTG IEAVADCQLV IEAIPERIEK KLGLFAELDR LCPPETIFTT
NTSSLPIITL AVATSRPSRV VGTHWSNPAP VMGLVEIIHT AVTDPSVLED VGTLVAKVGK
TAVVAGDRAG FIVTALLFGY LNSAVRMLEA CYATREDIDA AMRFGCGHPM GPLALLDLIG
LDSAYEILDS IYHTSRDHLH APAPLLKQLV TAGMLGRKTG RGFYTYAAPY SSEIVDMVEP
PQLGFFAVPG RPVHTIGVVG TGTMASGIIE VCAHHGYKVV FRARSEKKIA AVRTKIERSL
DKAVERGKIS SDERTSTLAR VRASTDLSVL AECDLIIEAV VEDLDVKRAL FAELDTVARP
GAVLATTTSS LPVIECATAT SRPQDVVGMH WFNPAKKMRL VEIVPTIVTA DDVTATVFDV
ARTAGKYPVR CADRAGFIVN TLLFPYLNDA VKMLESHYVD IDVIDTAMKV ACAHPMGPFE
LADVIGLDVT LAIQRALYRE FREPGYTPTL LLEDLVRSGC LGYKTGRGFR VY