Gene Francci3_2124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2124 
Symbol 
ID3905514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2494191 
End bp2495429 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content68% 
IMG OID637879459 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_481225 
Protein GI86740825 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.29674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTGG TGATCGACCG GGTCGCCGGG TTGGACGTGC GCCGGGACAC GGTCGTGGCG 
GCGGTCCGGG TTGGCGGGCG TGGTGGCGGC AGGCGGGGGG AGGTGCGGAC CTTCGCCACG
ACGGGAGCGG GACTGACCCG GCTGGCCGGG TGGCTGTCGG AACAGCGGGT TTCTCTGGTG
GGTATGGAAT CCACCGGCGT CTACTGGAAG CCGGTGTTCC ACCTGCTGGA AGACCGGTTC
GAGTGCTGGC TCCTCAACGC CACCCACGTC CGCAACGTAC CAGGCCGAAA AACAGACGTC
GCGGACGCGG CGTGGATCTC GGACCTCGTC GCGCACGGCC TGGTACGCGC CTCGTTCGTG
CCGCCGAAAC CCCAGCGGGA CCTGCGTGAC CTGACCCGGG CCCGGCGGAT CGTGGTCGAG
GAGAAGACCC GGGAGATCCA GCGGCTGGAG AAACTGATGC AGGACGCCGG CGTGAAACTC
ACCAGCGTCG CCTCCAAGCT ACTCGGGGTC TCGGGCCGTG CGATCCTGGA GAAGATGATC
GAGGGAGAGC AGTCCCTGGA ATATCTCGCT GATCAGGCCC GTGGCCGACT CCGCAGCAAG
ATCCCACAGT TGCAGGAGGC ACTCGCGGGA ACGTTCCGCT CCGGGCATCA CGGGTTCCTC
GCCGCGCAGC TCCTGGCCCG GATCGACCTG TGTGACGAGC AGATCGACGA GCTCGACCAC
CGGATCGAGG TGATGATCGC CCCTTTTCGG GAGACGGTCG ACCGGATCCG CACGATCACC
GGGGTCGGTG AGGTCACCGC GACCGTGCTG CTCGCCGAGG TCGGCCTGGA CATGAGCCGG
TTCCCCACCG CCGGCCATCT CGCGTCCTGG GCGGGTATCT GTCCGGGGAA CAACACCTCG
GGAGGGAAAC GCCTGTCCGG GCGGACCCGA CACGGTAACA AGTGGTTACG TACCGCGTTG
ACCGAGGCCG CGCACGCCGC CGCCCGGAGC AAGGACACCT ACCTGGCGTC CCACCACGCC
CAGGTCCGTG GCCGCCGCGG TGTCCTGAAA GCGATCGGCG CGACCCGCCA CGACATTCTC
ATCGCCTACT GGCACATCAT CGCGAACAAG ACCGTCTACC AGGACCTCGG CGGAGACTGG
CATGCCCGCC GACGCCGGGA CCCTGAACGC CGCCGGAAGA ACCTCGTCGG CGAACTGGAG
AAACTCGGCT ACACCGTCAC CATCACACCA GCGGCATAG
 
Protein sequence
MDVVIDRVAG LDVRRDTVVA AVRVGGRGGG RRGEVRTFAT TGAGLTRLAG WLSEQRVSLV 
GMESTGVYWK PVFHLLEDRF ECWLLNATHV RNVPGRKTDV ADAAWISDLV AHGLVRASFV
PPKPQRDLRD LTRARRIVVE EKTREIQRLE KLMQDAGVKL TSVASKLLGV SGRAILEKMI
EGEQSLEYLA DQARGRLRSK IPQLQEALAG TFRSGHHGFL AAQLLARIDL CDEQIDELDH
RIEVMIAPFR ETVDRIRTIT GVGEVTATVL LAEVGLDMSR FPTAGHLASW AGICPGNNTS
GGKRLSGRTR HGNKWLRTAL TEAAHAAARS KDTYLASHHA QVRGRRGVLK AIGATRHDIL
IAYWHIIANK TVYQDLGGDW HARRRRDPER RRKNLVGELE KLGYTVTITP AA