Gene Francci3_2429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2429 
Symbol 
ID3905041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2822237 
End bp2823496 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content71% 
IMG OID637879759 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_481525 
Protein GI86741125 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0694063 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.318501 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAGC ACCACATGAC CGAGCACCAG CGGCACCGCA CTGCGCCGGG CGCCGCCCCG 
GACTCCCCTT CGCCGCACGT CCCGCTGCCG TTGGCCGGGG TGACCGTGGT CAGCCTGGAG
CAGGCCGTCG CGGCCCCGTT CGCCACCCGC CAGCTGGCTG ATTTCGGCGC GCGGGTCATC
AAGATCGAAC GGCCCGACGG CGGCGACTTC GCCCGCCGCT ACGACGTGTC GGTGCACGGC
CAGTCGAGCT ACTTCGTGTG GCTCAACCGC TCCAAGGAAT CCCTCACCCT GGACGTGAAA
ACCGCCGAGG GGCGCGCGGT TCTGGAGGAG CTGCTGGCCA GGGCCGACGT GCTGGTGCAG
AACCTCGGCC CCGGCGCCGC CGCTCGCCTC GGCCTCGACG CCTCGTCCCT GGCCGACCGG
CACCCTCGGG TGATCCCCTG CACGATCTCC GGCTACGGTA CCGACGGGCC CTGGGCCGAC
CGGAAAGCCT ACGACCTGCT CGTGCAGTGC GAGACCGGCC TCGTGTCGCT GACCGGTAGC
CCGGACCAGA TGGCGAAGGC CGGGATCTCG GTCGCCGACA TCGCCGCCGG CATGTACGCC
TACACCGGCA TCCTCACCGC CCTCTACCGG CGCGCCACCA CGGGGTCGGT CTCCGCGGTC
GAGGTATCGT TGTTCGAGGC CCTCGCCGAG TGGATGGGTT CACCGGCCTA CTACACCCGG
TACGGCGGGC GGCAACCGGC ACGGGTCGGC GCTCAGCACG CCACCATCGC CCCCTACGGC
CCGTTCACCA CCGCCGAGGA CCAGACGGTG CTGCTGGCCA TCCAGAACGA ACGCGAATGG
CACGCCTTCT GCCGGATCGT GCTCGACGAC CCGACCCTGA CCGAGGACCA CCGCTTCGCG
ACCAACTCCG CACGCGTCGC CCACCGCGAC GCCCTCAACG GGGTGATCGC GGACCGATTC
GCCGCGCTCG ATACCGGCAC AGTGCTGGCG TTGCTGGCCA AAGCCGGTAT CGCCAACGCC
CGACTGAACT CCGTCGCCCA CTTCCTCGAC CATCCCGTCC TGACTGGCCG CGACCGGTGG
CGCACCGTTG CCACCCCCGG CGGCGACATC GGTGCCCTCC TGCCTCCGGT CACCCTCACC
GACCTGGACC CGGTGATGAA CCCCGTTCCC GCCCTCGGCG AACACACCGA CACCATCCTG
CGCAGCCTGG GCCGCACCGA TACCGCCATC GCCGCGCTCC GTGCCGACGG CGTCATCTGA
 
Protein sequence
MTEHHMTEHQ RHRTAPGAAP DSPSPHVPLP LAGVTVVSLE QAVAAPFATR QLADFGARVI 
KIERPDGGDF ARRYDVSVHG QSSYFVWLNR SKESLTLDVK TAEGRAVLEE LLARADVLVQ
NLGPGAAARL GLDASSLADR HPRVIPCTIS GYGTDGPWAD RKAYDLLVQC ETGLVSLTGS
PDQMAKAGIS VADIAAGMYA YTGILTALYR RATTGSVSAV EVSLFEALAE WMGSPAYYTR
YGGRQPARVG AQHATIAPYG PFTTAEDQTV LLAIQNEREW HAFCRIVLDD PTLTEDHRFA
TNSARVAHRD ALNGVIADRF AALDTGTVLA LLAKAGIANA RLNSVAHFLD HPVLTGRDRW
RTVATPGGDI GALLPPVTLT DLDPVMNPVP ALGEHTDTIL RSLGRTDTAI AALRADGVI