Gene Francci3_2474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2474 
Symbol 
ID3904852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2918242 
End bp2919426 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content70% 
IMG OID637879804 
Productlipid-transfer protein 
Protein accessionYP_481570 
Protein GI86741170 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.093907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGTC GGGTGATGGT GGCGGGCGTC GGCATGGTGC CGTTCGCGAC GCCGAGCAGG 
AGCGACAGCT ACGACGTGCT GGCCGAGGGG GCGGTCAGGG CGGCGTTGGC CGATGCCGGG
ATCGACCTGG CGGCGGTCCA GCAGACGTAC GCCGGCTATG TGTACGGCGA CTCGACGAGC
GGGCAGAAGG CGCTGTACCG GGTGGGGATG ACCGGGGCGC CGGTGGTGAA CGTGAACAAC
AACTGCTCTA GTGGGTCGTC GGCGCTGTTC CTGGCGCGGC AGGCGGTGGC TTCCGGGGCG
GCGGACTGCG TGCTGGCGTT CGGGTTCGAG CAGATGCGGC GCGGCGCGCT GACGATGCAG
TGGGACGACC GGCCCAACGC CTTCGAGCGG TTCGACGAGG TGGTCACCAA GGTGCAGGGC
GAGGTCGAGG GGGTGCCGTT CGCACCGCGG TACTTCGCCG GTGCGGGTGC GGCCTACTGT
GAGAAGTACG GGATGGACCC GGCGGTATTC GCGCGGATCT CGGTGAAGTC GCGCCGGCAC
GCGGCCAACA ACCCGTATGC GGTGTTCACT GATCCGGTGA CCGTGGAGGA GGTTCTTGCC
TCGCCGCGGA TCCTCGGTTG GCTGACCCGG CTGCAGTGCT GCCCGCCCAC CTGTGGCGCG
GCCGCCGCGG TGGTCGTTTC CGAGGACTTC GCGCGCGCCC ACGGCCTGCG CGCCGATGTG
GCGATCACCG CGCAGGCGAT GACCACGGAT ACCCCGTCCT CGTTCGACGG GGACCTGATG
CGCCTGGTCG GTTACGACAT GACCGCGGCC GCCGCGCGGC AGGTGTACGA GGTGGCCGGG
GTGGACCCGC TCGATGTGCG CGTGGTGGAG CTGCACGACT GCTTCACCAC CAATGAGCTG
ATGACCTACG AGGCGCTGGG GCTGACTCCG GAGGGCACGG CGGAGAAGTT CATCGTCGAC
GGCGACAACA CCTACGGCGG CCGGGTGGTG ACCAATCCGT CGGGTGGGCT GCTGTCCAAG
GGGCACCCGC TGGGCGCGAC CGGGTTGGCG CAGTGCGCGG AGCTGGTGTG GCAGCTGCGC
GGCGAGGCCG ACAAGCGCCA AGTCGAGGAC GTGACCGTGG CGCTGCAGCA TAACATCGGC
CTCGGCGGCG CCGCCGTGGT CACCCTCTAC GAGAAGGTAG GCTGA
 
Protein sequence
MSGRVMVAGV GMVPFATPSR SDSYDVLAEG AVRAALADAG IDLAAVQQTY AGYVYGDSTS 
GQKALYRVGM TGAPVVNVNN NCSSGSSALF LARQAVASGA ADCVLAFGFE QMRRGALTMQ
WDDRPNAFER FDEVVTKVQG EVEGVPFAPR YFAGAGAAYC EKYGMDPAVF ARISVKSRRH
AANNPYAVFT DPVTVEEVLA SPRILGWLTR LQCCPPTCGA AAAVVVSEDF ARAHGLRADV
AITAQAMTTD TPSSFDGDLM RLVGYDMTAA AARQVYEVAG VDPLDVRVVE LHDCFTTNEL
MTYEALGLTP EGTAEKFIVD GDNTYGGRVV TNPSGGLLSK GHPLGATGLA QCAELVWQLR
GEADKRQVED VTVALQHNIG LGGAAVVTLY EKVG