Gene Francci3_2880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2880 
Symbol 
ID3906011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3396520 
End bp3397491 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content68% 
IMG OID637880201 
Producthypothetical protein 
Protein accessionYP_481967 
Protein GI86741567 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.149752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.426146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGTC CGCCTTCGCC GCATGACGCG GTGTTCCGCC GGGTCCTCGG TGTGCCGTCG 
AACGCGGCGT CGCAGCTGCG CGCGACGCTG CCGGCGGCTC TTGTGGCCCG CCTCGACCTC
GACCGGCTGG CGATTGTCCC CGGTAGCCTG GTCGACGCCA CGCTGCGGTG GCGGCACACC
GACCTGCTCT TCACCGCTCC GCTTGACGGC CATGAGGCGT TCATCTACGT CCTCGTCGAG
CACCAGAGCA GCAGCGACCC GCTTATGGCG TTCCGGATGC TGCGTTACGT CGTGCGTGTT
TGGGACCGCT ACCTGGCAGA CCATCACAAA GCGGCCCGGC TGCCAGCGGT GGTCCCGCTG
GTCGTGCACC ACAACGAGCA CGCGTGGGTC GCCCCGACTC AGGTGCTCGA CCTGGTCGAC
CTGGCTCCGG ACCTCGCCGG CGCCTGGCGG GAGCATCTGC CCCGGTTCCA GTTCCTGCTC
GATGACCTGG TTCGCGTTGA CGAGCGGGAG CTGCGGGAGC GTCCGCTGAC GCACTCGGTG
CGGCTCACTC TGCTCCTCCT CAAGATCGTC CCTGGTAACC CCCGGCTCGC CCAGGACCTG
CGACCGTGGG TCGACGAACT GCGCGCGGTC CTCGACGGCC CGGATGGCAG GGAGGAGTTC
GCCACTTTGC TGCGTTACAT TGAGCTGGTC GGAGAAGCGG ACGCCCGCGA CGAGTTGCAT
GACCTGATCG CCGGCCTTGG ACCTGAGGCG GAGGATGCCT ACATGACCAT CGCAGAGATG
CTCCGTGCCG AGGGTCGTGT CGAGGGTCGC GTCGAGGGTC GTGTCGAGTC GCTCCTCCAG
CTGTTGACCC TCAAGTTCGG TCCGCTTCCC GAGGCCGCGC TCGCCGCCGT GCACGACGCC
TCCGCCGGCC AGCTCCAGAC CTGGACCGCT CGCGTCCTGA TCGCCGACAC GCTCGACCAG
CTTTTCCTCT GA
 
Protein sequence
MSSPPSPHDA VFRRVLGVPS NAASQLRATL PAALVARLDL DRLAIVPGSL VDATLRWRHT 
DLLFTAPLDG HEAFIYVLVE HQSSSDPLMA FRMLRYVVRV WDRYLADHHK AARLPAVVPL
VVHHNEHAWV APTQVLDLVD LAPDLAGAWR EHLPRFQFLL DDLVRVDERE LRERPLTHSV
RLTLLLLKIV PGNPRLAQDL RPWVDELRAV LDGPDGREEF ATLLRYIELV GEADARDELH
DLIAGLGPEA EDAYMTIAEM LRAEGRVEGR VEGRVESLLQ LLTLKFGPLP EAALAAVHDA
SAGQLQTWTA RVLIADTLDQ LFL