Gene Francci3_4088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4088 
Symbol 
ID3907052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4895926 
End bp4897356 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content71% 
IMG OID637881416 
Productputative replication initiation protein 
Protein accessionYP_483165 
Protein GI86742765 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGTGG GCGTGAGCTT GCCCGAAAAC CGGCCGGTCG CGGGTGGCTG TTCCCGGCCG 
ATCCGCCTCT CCGGCCATGT CGATCATGTG GATGTTGGTA CCGGTGAGGT CCGCCGCGCG
TTCACCTCGG CGGGTGAGCC GGGCGGCGTG TTGCATGTGC GGTGCAACAA CCGGCGTGAG
TCGGCGTGCC CGGCGTGCTC GGCGGTCTAC AAGCGGGACG CCTGGCGGCT GGTCCTGGCT
GGGCTCGCGG GCGGCAAGGG CGTGCCGGAG ACGGTGACCG GGCATCCGGC GTGGTTCGTC
ACCCTGACCG CGCCGTCGTT CGGGCCGGTG CATTCCCGCC GCCAGTACGG CGGGAAAACC
GGTCCAGTGC AGGCATGTCA CCCCCGGCGG GGACTGTGCC CGCACGGGAA ACCGGCGGGT
TGTCATGAGC GGCACCGCGA GGATGATTCC CGGCTCGGTT CCCCGATCTG CCCGGACTGC
TACGGCTACG GCCGGTCGGT GGTCTGGAAC GCGCTTGTTC CCCGGTTGTG GAAGGCCACG
CGGGACGCGA CGGAATCGGC GGTGGCGGCG GCGGCCGGTC TGACGGTGGC GGGGCTGCGC
CGTGCGGCGC GGTTGAGCTT CGTCAAGGTC GCGGAAATGC AACAACGCGG GGTCGTGCAT
CTGCATGTGG TGGTCCGGGT GGACGGTCCG GACGGTCCCG GCTCGGCTCC TCCGGCGTGG
GCGGCTGGTG AGCTGGTCGC GGACGCTCTG CGGGGCGTGG TCGGGTCGGT TTCGGTGCCT
GCTCCCGATC CGGACGCGGC CACCCTCGAC GCCGGCGCTG GTGCCGGGGA TGGGTGGGCG
GTGCGCTGGG GTGTGCAGGT CGATATCCGG CGTATCGCGC TGGATGGGCC CACCGACGTC
GGGCGGGTCA GTAACTACCT GGCGAAGTAC ATCACGAAGT CTGCGGCGGC CGGTGGGGCG
TTGGATCATC CGGTGCGGTC GCTGGCCGCA CTCGGCCGGC TGGTCCTGGT TCCGCATGTG
CGCCGGTTGG TGGAGACCTG CTGGCGGCTC GGCCACGACG CCACGTTCAC GGCGGCGTTG
GATGCGGCAC TCGGCCGGGA CTCCGGCGAT GTCCCGCGAC TGGTCCGCTG GTCTCACCAA
ATGGGCTTTG GTGGTCACTG GCTGTCAAAG TCGCGGCGGT ACTCGACCAC GTTTGGTGCG
CTGCGGACGG TGCGGCGAGT CTGGTCGCGC ACGATCGGTG CGGCGATGTC GGGCCGGGTG
CCGGTGGATG CGTTCGGCCG TCCGGACGGC GATCCCGACA CGTTGGCCCT CGGGGCATGG
ACCTACGCGG GGCGTGGTCT ATATGCCGGG GATCACGGTG ATGATCCACC TGACGGGATG
TCGGTGCCGG CGGCTGGATC GGGTTCCGAC GTGTGGCTGG CCGGGCTATG A
 
Protein sequence
MDVGVSLPEN RPVAGGCSRP IRLSGHVDHV DVGTGEVRRA FTSAGEPGGV LHVRCNNRRE 
SACPACSAVY KRDAWRLVLA GLAGGKGVPE TVTGHPAWFV TLTAPSFGPV HSRRQYGGKT
GPVQACHPRR GLCPHGKPAG CHERHREDDS RLGSPICPDC YGYGRSVVWN ALVPRLWKAT
RDATESAVAA AAGLTVAGLR RAARLSFVKV AEMQQRGVVH LHVVVRVDGP DGPGSAPPAW
AAGELVADAL RGVVGSVSVP APDPDAATLD AGAGAGDGWA VRWGVQVDIR RIALDGPTDV
GRVSNYLAKY ITKSAAAGGA LDHPVRSLAA LGRLVLVPHV RRLVETCWRL GHDATFTAAL
DAALGRDSGD VPRLVRWSHQ MGFGGHWLSK SRRYSTTFGA LRTVRRVWSR TIGAAMSGRV
PVDAFGRPDG DPDTLALGAW TYAGRGLYAG DHGDDPPDGM SVPAAGSGSD VWLAGL