Gene Francci3_3872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3872 
Symbol 
ID3906640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4634762 
End bp4636243 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content73% 
IMG OID637881198 
Producthypothetical protein 
Protein accessionYP_482951 
Protein GI86742551 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACACGA CCAGCGCGCT GCTGGCCCTG ATCGGGCTGG CTCTCGGCGC CATGGGCGGA 
TTCGCCGCCG CGCGCCGCTT CGCCGAGCCG CGGGTTGCCG CGCTCACGGC TGCGCACGCC
ACCGCCGTGC GGGAGCGCGA CCAGGCCCGG GAGACCGCCG CCGCCGCTGC CGAGCGTGCG
ATGCTGGCCG AGTCGGACGT GTCCAGCCTG CGCACCGCGC TGGAGTACGA GCAGCGAGCG
GCGGGCGAGC GGGTGGCGCT GGTCGAGCAG AGCCAGGACC GGCTTGCCGA GAGCTTCCGA
GCGCTGTCCG CGCAGGCTCT GGAGGGCGCG AGCCGTCAGC TGGTAGAGCT GGCCTCCGCC
CGGCTGGACG AGGCCGGCGC CCGCGCCCGC GGCGACCTCG ACGCCCGTCG TTCGGCGGTC
GAGAGCATGG TGACGCCGCT GCGGGAGGCT CTCGGGCGGA TGGAGGACCG GCTGCGGGAG
CTGGAGACCG CGCGCACTGA GGCCTACGCG GCACTCGTCG AGCAGGTGCG GTTCGCCCGC
GAGGCGTCGG AGAACCTCCG GTCGCAGACC GCCGCGCTCG TGACCGTGCT GCGCCGGCCC
CAGGCCCGCG GTGCCTGGGG CGAGATGCAG CTGCGCCGGG TGGCGGAGGT GGCCGGCATG
CTCAACCGCT GCGACTTCAC CGAGCAGATG ACGATCCAGG GCGACGACGG CCCGCAACGG
CCCGACATGG TCGTCCACCT TGCCGGTGGC CGCAACGTGG TCGTCGACGC GAAGGTTCCG
CTCAGCGCGT TCCTGGAGGC CGCCGACACG ACGGACGAGG AGCATCGCGC GCGCCGGATG
GCCGCCCATG CCCGCCATCT GCGTGCGCAT GTCGACGGCC TCGGCGCCAA GTCCTACTGG
CGGCGGCTGC CGTCATCCCC GGAGTTCGTG GTGCTCTTCG TCCCCGCCGA GGCGTTCCTG
GCCCCTGCCC TCGATCACGA TCCCGGCCTG CTCGAACACG CCGCAGGCAA GAAGGTCATC
ATCGCCACCC CAACCACGCT GATCGCTATG CTGCGGACCA TCGCCCACGC TTGGACTCAG
GATGCGCTGA CCGCACGGAC GAAGGAGATC TTCGAGCTGG GTCGCGACCT CTACACCCGC
CTCGGCACCC TGGGCGAACA CGTCGATCGC CTCGGTCGCT CGCTCGGCCG GGCGGTGGGG
GACTTCAACG CCACCGTCGG CTCGTTGGAA AGCCGGGTGC TGACCCCCGC CCGCCGGCTC
GCGGCCATGG AAGTCGTCGA GGCGGGGCTC CCCAGTCCGG TTCCGGTAGA GACCGGCGTG
CGGCCGCTGT CCGCCGCCGA GCTCCTGAGG AGCACCGGGG AGGGCGCGAC GACCGGGCGG
GGTGGCGCCA TGGAGGACCC CGGAATCGAT GTCGGGTACC AGACCCCTGA CGGCGCAGAT
CCGGCCAGAT GGGACGCGAA CGACCAGCAT AAGGAGGATT AA
 
Protein sequence
MDTTSALLAL IGLALGAMGG FAAARRFAEP RVAALTAAHA TAVRERDQAR ETAAAAAERA 
MLAESDVSSL RTALEYEQRA AGERVALVEQ SQDRLAESFR ALSAQALEGA SRQLVELASA
RLDEAGARAR GDLDARRSAV ESMVTPLREA LGRMEDRLRE LETARTEAYA ALVEQVRFAR
EASENLRSQT AALVTVLRRP QARGAWGEMQ LRRVAEVAGM LNRCDFTEQM TIQGDDGPQR
PDMVVHLAGG RNVVVDAKVP LSAFLEAADT TDEEHRARRM AAHARHLRAH VDGLGAKSYW
RRLPSSPEFV VLFVPAEAFL APALDHDPGL LEHAAGKKVI IATPTTLIAM LRTIAHAWTQ
DALTARTKEI FELGRDLYTR LGTLGEHVDR LGRSLGRAVG DFNATVGSLE SRVLTPARRL
AAMEVVEAGL PSPVPVETGV RPLSAAELLR STGEGATTGR GGAMEDPGID VGYQTPDGAD
PARWDANDQH KED