Gene Francci3_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1016 
Symbol 
ID3906258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1210973 
End bp1212196 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content73% 
IMG OID637878349 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_480128 
Protein GI86739728 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTTCG TCGGTGACGA CTGGGCGCAG GACCACCACG ATGTGGAGGT GCAGGACGAG 
ACGGGTCGGC GACTGGCGAA GGGCCGGCTG CCGGAGGGCG TGGCCGGGAT CGCCCGGCTG
CACGCGCTGA TCGGGCGGCA CCTGGCCGAG GACGCCGGCC CGGAGCAGGT CGTGGTCGGG
ATCGAGACCG ACCGCGGCCC GTGGGTGCGG GCGCTCGTCG CGGCGGGCTA CCAGGTGATC
GCGGTGAACC CGTTGCAGGC GGCGCGGTAC CGGGAGCGGT ACTCGACGTC GGGTGCCAAG
AGCGACGCCG GCGACGCGCA CAGCCTGGCG GACATGGTCC GTACGGACCG TCACCAGCTG
CGGCCGGTCG CTGGGGACAG TGACACCGCC GAGGCGGTGA AGATCGTGGC GCGGGCGCAT
CAGAACCTGA TCTGGGACCG GACCCGCCAG ACCCAGCGGC TGCGCTCGGC GCTCCTGGAG
TTCTTCCCGG CCGCGCTGGC CGCGTTCGAC GACCTCGATA CCCCTGACGC GCTGGAGCTT
CTCGCGAAGG CGCCGTCGCC GGCCGAGGCC GCGAGGCTGA CCGTTGCGCA GATCAGCGCC
GCGCTCAGGC ACGCCCGCCG GCGGAAGATC CCCGAGAGGG CGGCCGCGAT CCGGGCGGCG
CTGCGGGCCG AGCAGCTGCC CGTCACGCCG GCGGCGACCA CCGCCTACGC CGCGGTCGTG
CGCGCCCAGG CCGGGCTGCT CGCAGCCCTC AACGGCGAGA TCGCCCGGCT CGAGGAGCAG
GTCGCGGACC ATTTTGACCA GCACCCGGAC GCGAAGATCC TGCTGTCCCA GCCCGGCCTG
GGACCGGTCC TCGCGGCCCG GGTGCTCGCC GAGTTCGGTG ACGACCCGAC GCGCTACGCC
GACGCGAAGG CACGGAAGAA CTACGCCGGC ACGAGCCCGA TCACCCGCGC CTCCGGGAAG
AAGAAGACGG TCCTGGCCCG CTACGCACGC AACAACCGGC TCGCCGACGC GCTACATCAG
CAGGCGCTCT CGGCCCTGAG CGCATCCCCG GGCGCCCGGT CGTACTACGA CGCGATCCGC
GCGCGCGGCA CGTCGCACCA CGCCGCGCTG CGCCAGCTCG GCAACCGGCT CGTCGGAATC
CTGCACGGCT GCCTCAAGAC CCACACCCCC TACAGTGAGG CAACCGCATG GACACAGAAA
GCAACACTCG ACGTCGCCGC TTGA
 
Protein sequence
MLFVGDDWAQ DHHDVEVQDE TGRRLAKGRL PEGVAGIARL HALIGRHLAE DAGPEQVVVG 
IETDRGPWVR ALVAAGYQVI AVNPLQAARY RERYSTSGAK SDAGDAHSLA DMVRTDRHQL
RPVAGDSDTA EAVKIVARAH QNLIWDRTRQ TQRLRSALLE FFPAALAAFD DLDTPDALEL
LAKAPSPAEA ARLTVAQISA ALRHARRRKI PERAAAIRAA LRAEQLPVTP AATTAYAAVV
RAQAGLLAAL NGEIARLEEQ VADHFDQHPD AKILLSQPGL GPVLAARVLA EFGDDPTRYA
DAKARKNYAG TSPITRASGK KKTVLARYAR NNRLADALHQ QALSALSASP GARSYYDAIR
ARGTSHHAAL RQLGNRLVGI LHGCLKTHTP YSEATAWTQK ATLDVAA