Gene Franean1_2361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2361 
Symbol 
ID5670757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2805470 
End bp2806654 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content72% 
IMG OID641241278 
Productlipid-transfer protein 
Protein accessionYP_001506699 
Protein GI158314191 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0692288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG GGTTCATCCG GGACGCCGCG TGCATCGTCG GGATCGGTCA CTCACAGTAC 
GGAACCCGCG GCTCGCTGGC GCCGCTCGGC CTGACCCGCA TCGCCTTCGA CGCCGTCCAC
GACGCCTGCG CCGACGCCGG GCTCGACGCC AGGGACATCG ACGGGTTCGC CGGCTACTGC
GACGACCCGA CCCTGCCCGC CGACCTGGCC GTAGCGCTCG GCACCCGGGA GCTGCGGTAC
GCCGGCATGA CCTGGGGCGG TCGTGGCTCG GGGCTGCCCG GCGCGGTGGC GGGAGCTTAC
GCCGCGGTCG CCACCGGACT GGCCGACCAC GTCGTGGTCG TGCGCTCGAT CATCCAGCAG
GCGCGGCTGG GGCAGTCGGT GGCCGCCGGT GTGCAGCCGG GGCAGGCGAT TCCGCTGTCG
GCGTCCTACA CCTCGCCGTT CGGCATGGCG CTGCCGGCCG CGATCTACGC GATGAAGGCC
CGCCGGCACA TGGCGCTGCA CGGCACGACG ACCGAGCAGT TCGCGCAGGT CGCCATCAAC
GCGCGGCGCA ACGCGGTGAA CAACCCCGAC GCGCGTTTCC GCACGGAGAT CACCGTTGAG
GATCATCACG CCTCCCGGCT GATCTGTGAC CCGCTGCGGC TGCTGGACTG CTGCATGGAG
TCCGACGGCG CCGCCGCCGT GATCATCACG ACGCCCGAGC GTGCCCGGGA CCTGCGCCAG
CCACCCGTGC GCATCCGCGC GGTCGCGGCG ACCGGCGAGT ACAAGTGGGC CACCGCGTCG
TTCAACACCG TCGACGAGGA TTTCGTCAGC ACCGGGCACC GCCGAGCCGC CCGCGATCTC
TACCAACGGG CGGGCCTGGG CCCCGAGGAC GTCGACGTCG CACTGGTCTA CGACGGGTTC
ACGCCGTCGG TGATCATGAG CCTCGAGGAT TTCGGCTTCT GCGGTATCGG CGAGGGCGGC
CCGTTCGTCG AAGGGGGCGC CATCCGGCGG GAGGGCAGCA TTCCCGTCAA CACCCACGGC
GGGAATCTCG CCGAGGTCTA TCTGCAGGGC ATCACCCACC TGCTCGAAGG CGTCCGGCAA
CTGCGCGGGA CGGCCGTCAA CCAGGTGGCC GGCGCCGACG TCGCCCTCTA CGCTTCCGGG
GTCGGCGCCT CGCCGGGCGG CGGGGTGCTG CTCCGCCGCT GGTGA
 
Protein sequence
MSGGFIRDAA CIVGIGHSQY GTRGSLAPLG LTRIAFDAVH DACADAGLDA RDIDGFAGYC 
DDPTLPADLA VALGTRELRY AGMTWGGRGS GLPGAVAGAY AAVATGLADH VVVVRSIIQQ
ARLGQSVAAG VQPGQAIPLS ASYTSPFGMA LPAAIYAMKA RRHMALHGTT TEQFAQVAIN
ARRNAVNNPD ARFRTEITVE DHHASRLICD PLRLLDCCME SDGAAAVIIT TPERARDLRQ
PPVRIRAVAA TGEYKWATAS FNTVDEDFVS TGHRRAARDL YQRAGLGPED VDVALVYDGF
TPSVIMSLED FGFCGIGEGG PFVEGGAIRR EGSIPVNTHG GNLAEVYLQG ITHLLEGVRQ
LRGTAVNQVA GADVALYASG VGASPGGGVL LRRW