Gene Francci3_3014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3014 
Symbol 
ID3904367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3579803 
End bp3581032 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content70% 
IMG OID637880334 
Productprolipoprotein diacylglyceryl transferase 
Protein accessionYP_482100 
Protein GI86741700 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0682] Prolipoprotein diacylglyceryltransferase 
TIGRFAM ID[TIGR00544] prolipoprotein diacylglyceryl transferase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0478204 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTCTCG CCGCTATTCC TAGCCCATCG CGTGGCGTGG TGCATCTCGG GCCGGTGCCG 
CTGCGTGCCT ACGCCCTGAT GATCATTATT GGCGTCTTCG TCGCCGTATT CGTGACGGGA
AGAAGGCTGC GCGCTCGCGG CATGGATCCC ATGGTGGCCA GCGAGGTCGC CTACTGGGCC
GTCCCGTTCG GCATCGTCGG CGCCCGCGTC TACCACGTGG TCAGCACTCC GGCCGCCTAT
TTCGGCCGGG ACGGCAACGT GCTGGACGTC ATCAAGATCT GGAACGGCGG GCTGGGTATC
TGGGGCGCCA TCGCGGGCGG GGCCTTCGGC GCGTGGCTGG CGACCCGGCG CTACGGCATC
AGCCTCGCCC TGTTCGGCGA CGCCGCGGCG CCGGGCATCA TCCTGGCCCA GGCGATCGGA
CGCTGGGGCA ACTGGTTCAA CCAGGAGCTG TACGGCAAGG CGAGCACCCT GCCCTGGGCG
GTGCGCATCG ACGAGAAGCA CCAGATCATC CCCGGCGTGT CCACCTATCA GCCGACCTTC
CTCTACGAGT GCCTGTGGAA CCTGGTGGTG GCCGGGATCC TGTTGGTCGT CGATCGGCGG
CACCGGCTCG GCCGCGGCAA GCTGTTCTGC CTCTACGTCG CGCTCTACAC GTTCGGCCGG
TTGTGGATCG AGATGCTGCG CATCGACACG GCGAACCAGA TCCTCGGGCT GCGGGTCAAC
ATCTGGACCT CGATCGTCGT CTGTCTGGGG GCGTTGCTGG CGCTGGCGGT CACCCGCAGT
CCCGTGGATC CGAATCTGTC CAGGGAGGAG CAGGAGGCCC TCGGAATCGC CCGTTCCCGG
CCCGCGGCGC GGTCCACGGT GACGACCGCC GGTACCGCCG ACCAGCGGGC GGCCGCTCCC
GATTCGGCCG GTCCCGATTC GGCCGCTCTC GATTCGGTCG GTCCCGATTC GGTCGATCCT
GATCTGGGCG GTCCCGATCC GGCCGATCCT GGTTCCGCCG GGTCGGTGCC CGCCGCCGCG
GTGCCCGATG CCTCCGGGTC GACCGCCACC ACTGCCACTA CCGCCACCAC CGCCACCACC
GCCACTACCG CCACCACTGC CACCACTGCC ACCACTGCCA CCACCGGCGT ACCGGCTGGT
TCGCAGCAGA GCCGCGGCCT GGCGACGAGA TTGCCGGCGA GCGGTGGGCA CACGTCGGCC
GTTCCGCCGG AGGAGCCGCA GCTGCCCTGA
 
Protein sequence
MVLAAIPSPS RGVVHLGPVP LRAYALMIII GVFVAVFVTG RRLRARGMDP MVASEVAYWA 
VPFGIVGARV YHVVSTPAAY FGRDGNVLDV IKIWNGGLGI WGAIAGGAFG AWLATRRYGI
SLALFGDAAA PGIILAQAIG RWGNWFNQEL YGKASTLPWA VRIDEKHQII PGVSTYQPTF
LYECLWNLVV AGILLVVDRR HRLGRGKLFC LYVALYTFGR LWIEMLRIDT ANQILGLRVN
IWTSIVVCLG ALLALAVTRS PVDPNLSREE QEALGIARSR PAARSTVTTA GTADQRAAAP
DSAGPDSAAL DSVGPDSVDP DLGGPDPADP GSAGSVPAAA VPDASGSTAT TATTATTATT
ATTATTATTA TTATTGVPAG SQQSRGLATR LPASGGHTSA VPPEEPQLP