Gene Rcas_3939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3939 
Symbol 
ID5541445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5140937 
End bp5146210 
Gene Length5274 bp 
Protein Length1757 aa 
Translation table11 
GC content62% 
IMG OID640896047 
Productalpha-2-macroglobulin domain-containing protein 
Protein accessionYP_001433990 
Protein GI156743861 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTGTGC TGCGACAGAT CGGTGTGATC TGGACCGGCG CTTTGCTGGT CATTGTGGTG 
GCGTCCCTGG CGCTGCCAGG AGCGCGTTTC CTCATCCTCC CCGCGCTCTC CGCCGTTCCT
GCGGTTGTTT CAGTGTCGCC GCCTGATGGC GCGCGCAGTG TGTCGCCACG CGCGGTGCTC
ACTATTCAGT TCAGTGCATC GATGAATCCG CCGAGCGTCG AGCATGCGTT GCGTATCGAT
CCCGCGACGG ATGTCGTCTA TGGTTGGGAT TTGGATCGCA CAACACTGAC GATCACGCCA
ACGACTGCGC TTCAGGCTGG CGTGCGTTAC CACATCACGG TTGCGGAAAC GGCGCTGAGT
CGGTATTTTC GCCCGCTGGC GCAACCGTTC ACGTTCGCCT TCGAGACAGC GCCGCCTGCT
GCCATCACTG CCCTGTTGCC CCGCGATGGC AGTGTTGACG TCGCACTCGA TACGCTGGTC
GGTGTGCGGT TTAGCCGCCC GATTGTTTCG CTCGACGCCC TTGCCCGCTC TGCGACGCTG
CCTGCTCTCC GGAGCGACCC GCCGCTGGCG GGAAGCGTCA CCTGGCTCGA CCCGGCGACG
TTGCTGTTCC GTCCATCCGA GCCGCTGCGC CCTGGCGTGC GCTATACCTT CTCCCTCGAT
CCAGACCTGA CCGATCAGAG TGGTGTTCCG CTTGGACGAG CCTATACCTG GTCGTTTACA
ACGCTGGCGC CGATGGTGCG TGAGGTGTCG CCGCCGCCGA ATGCGCGCCT GGTGGGACCG
TATGAACCGC TGCGCATGGT GTTTTCGCAA CCGGTCGATC TTCAGGCGCT CGAAGCCGCA
CTTTCGATTA CGCCTGCGGC GCCTGGGACG CTTGAAGAAG CGCTTTTGCC CGATGGCACG
CAGATTGTGA CATATACGCC CACCATTGCG TGGCAGGCAG GAACCGCTTA CACCGTTGCG
CTTCCGGCGG CGCTGGCGGA TGGAACGGCG CTCCTGGCAC AACCGTATCG CTGGAGTTTC
GTTACCGCGC CAAAACCGGC ATTGATCGGA CGGTTCCCCG GCGAGGGGCA ACTGCTTCCG
CCTGGAAGCA ATGTGCGCCT GATCTTCAGC ACTCCGGTCG ATGCCGAGGC GCTGCGCGCA
CAACTGCGTA TCGATCCGCC GGTCGATGCC CTGCGTGTGA CCACCAACGA TGGTGAAGCG
CGCATCGATG CATCGCTCCA GGCGGCAACG CTGTATACCA TCACCATCCC GGCATCGCTC
ACCGATCGTG CCGGCGTTGC GCTCGAACGT GACTATCAGG TGCGTTTTTT TACGGCGCCT
GCTGTACCAT CGTTGACCCT GCCAGAGGCG ACCGGGCGTG TGATCCAGGC GCTTCCCGAT
CAGACCATTG GCGTGTTGAC GCGACGGACG AATCTCTCGG AATTGCGGCT GGCGCTCTAT
CGGCTCGATG AGGCCACGCT GCTGCGCGCC TTGAGTTTTA GCGACTCGGA ATGGACTGCC
TTTGAGCCGT CACGCTACGG ACTGTCACTC TTGCGCTCAT GGTCCGAACC ACTCGCCGAT
CCGCTCAATA CAACGGTCGA ATCGAACGTG ACGGTGACGC TCGACGATGG CTCGCCGCCG
CCATCGGGCA TCTACTTCCT GCGCCTCCGC ACGCCCGAAC GCGCAGGAAC CGGCGTCATT
CTCGTTGTCT CGCGCGCTGC GCTCTCGCTT CAGGTCATTG GGCAACGCGC AATTGTGTGG
ACAACGGATA CGGTCAGCGC AACGGTGATC CCTGATGCGC CGCTTGCACT TTACCGTCAA
GGTTCACCGG TCGCCGTCGG GCGCAGCGAT GACCGCGGGG TGTGGACCAT CGATTTGAGC
AGTATGAATC CGCGCGATCT TGTGGCGATC AGCAGCGATC TGTCCACATT CGCTGCATTG
GAAACGCCAC CACCGACCGC GCCTGCGCCG CGCCTGCGCA TTGTTCTGGC AACCGACCGC
ACTGTCTATT CGCCAGGCGC ACAGGTATCG ATCCGCGGTT TTGCCCGTCA GGTTGACGGG
CAATCGTTCG AACCGCCAAC GCCAGGATTG CCGCTCTCTC TTGAAGTGCG CGATCCATCC
GGTCGCACAG TGCAGAAGCG GATCACCCTC GATGGAACAG GCGTCTTTGA GACAACGTTG
CCGCTTTCAG ACAATGCGCA ACCCGGTATC TATCGTGTTT CGACACCGCA GGATACAGAC
AGCACACTTG CATTTTATGT CGATGAATCG ATGCCGCTCC GTGTGACGGT CGCCAGGGAT
CACAACAACG ATGTGCTGGT AACCGTGCGC ACCCCGGAAG GACTGCCGGT TGCCGGTGCC
GATGTCGCCT GGACTATCGA CTGCGAACCG CTGTCCCCTC CAATGAATGG CGAGATCGTG
TTCGGCGCAG CCGACCCTCC ACCGCCGTTG TCGGGCACAG GAGTAGCCGA TAGCAACGGC
GTGCTGGTGA TTGCCACACC CTCTCTATCG CCTGCTTCGT TCTATCGCTA CCGCTTCCGC
GCCCGGGTTG CCGAACCATA TGGACCTGCA ATCACTATCG AACGACTGCT CGAAACGCCG
CCTGCGCCAC TCGTCGGGCT TCGCACGCTG TCGTCGATTG TCCGTGTGGG GACGCCAGCG
AGTGTCGAAG TCATCACACT TATCGGCGAT CAGCCGCTCG CCGCACAGCG TGTGCAGATC
GAGGCGACGC TGCTCAATGG CGAGGCATCC AATGACGCTG CGGCATCGCC TGTGGATCGA
CCGATTCTGA GTCGTGTGGT CGAGACCGAC AGCGATGGAA GAGCGATATT GAGCGTACCG
CTTCCGGCGC CTGGCGTCTA TCGGGTGCGC GCTTCACTCG ACAGCGTCGC TCAGGCGACT
CCGCCAACCG ACCTTATTCT GCGCGCATAC CAGCCTGGCT TTACCGATTG GCGCGGAGGG
CAGCAAGGAA CGATGCTGCT GACCGACCGC CGTCAGTATC GACCAGGCGA TACGGCGCTG
CTCCTGCCGC TCACGGCGCT TCCTGAAGGA CCAGCATTGC TGACCGTCCA CAGCAGTTCG
GGAGATGTCC GGGAAGAATT GCGCACCCTG CGCGCCGGTG AACCGATGAC ATTGACCCTG
ACGCCTGATG ATGCGCCTGG TGTTTGGGTG ACATTGACGC CTGCACTACG CCTGCCCGTT
CAACACCCCT TGCAGGTCGA TCTACCGGTT GTAGCGGTGG ACACGTCGCT GTCACTCACG
CTGACGACTG ATGCACAAAC GTATACACCC GATACGAACG CCACACTCAC ACTTACCGTC
ACCAATGCAA ACGACACTCC GACACCGGTC GATACGCTGG TACGGATTGT GACCGATGAC
GCTGAGATGC AGCAAACGGT TGTATGGCGT GTAGAACGAA CGAACGACGC CGGTGTGCTG
CGCATCAATG CGCGGTTGCC GCGTTCTCCC GGCATCGTTC CGGTGGATGT GTGGGTTGCC
GGTGAACGCG GTATCGGCAG AGTCAGCACG CGATTGACCG TCGTTCAACC GGTCGCCGCA
CAGATTATTG CGCCGCCGTT TGCGCGCGCT GGCGATCAGA TCGATACCCG CATTCGTCTC
ACGACAACCG ACGGCATAAC GCGAGAAACC AGCGTCGCAC TGCGCCTCCC TGATGGAACA
TCGAGCGTTC AGATCGTCAC AATTCCAGCG ACAGGCGTGA CGTCCCTCCC GTTCACGCTG
CGCGCGCCTG ATGCTATTGC CCTGGAGGTG CAGGCGACGG TGACAACCGG CGCAGCATTC
AGCGAGACAG TGCGCACGAC GCTTCCTGTA CGCTCGCCTG CCGCAACGAT CAGCAGCTCA
GGCGGCGCTC TTGTGACCGA TCGCTTCGAG ACGCAGATCG CAAGACCCGG TGATGGTTCG
GCGGAGTGGG GGTGGCTCGA TCTCGCCGTT GCACCGTCGC TGAAGGGGCT GTCGCTGGAA
CAGTCGCGCG CGTTCATGGC ACTTCCTGAC CGACATTCGC TCGAAGATGC GGCTATCATC
CTGATGGCAG CGTCGCTGAC AGAGGCGCGC CCGGATGTGC AGGCAGCAAC CAAACACCTG
GTAGAACGGC AGACAAACGA TGGGGGATGG TCATGGCGCT TCGGTGGCGG GTCAAATCCA
GCCGTCACTG CCATCGTGCT CGAGGCGCTC GCCGACGCAA AGGCAGCCGG TGTCCCGTTG
CCCGACGGGT CGCTTGAACG CGCGACAACC CTGGCGCTTC GTCTTGTGCG TGATCCTGAT
GTTCCGCTCG AAACGCGCTT CTGCCTGAGT TCTGCGCTGA CCCAACTCGG TGTTGTTGAG
TCGTTGCTTC CGCGTGACTG GGACGAGGGT GAACTGGACG TTCACGGGAT GGCATGCCAT
CTGTTCGTGT TGCCGCCGGA TCAGGCGCGC GTCAGCCCGA CGCTGCCCCG GCTCATCAGT
CTGGCGCAGC GCGCAGATGG AAAGGCGTGG TGGAGCGCAC CGCCCGATAG TGCGTTCCCT
TATGACGACG TAGCCACAAC GGCGCTGGCG ATGCGTGCGA TCCATCACGC ATCCCCGCGC
CATCCGCTGG CAACCGACGC CACGCGTTGG CTGATCGCTC GGATGACGCC AACCGGATGG
GGCGACGCGC TGACAACTGC GCGCGTTGTG CAGGCGCTGC GCGTCATTAT GCCAGCAGAT
GCGTCGGCAT CCGTTACCCT CTCCCTGAAT GGCGCACCCA TCACCCTGCC CGATACGCCG
GATGCCACGC TGCGCCTGGT TCCCATCCCC ATTGCCGATC TGCGCCCAAC CAACACGCTG
GTTGTCACCA GTAGCGGCGC GCCGGCGCTG GTCGCGTGGC AGACGACACA CGCGGTCAGC
GCTCCCCTCT CATTCGAAGG CGTCGGTCTG CTCCGCGAAT ATCTGGACCC ACGCACTGGT
GCGCCGATCA ATCCGGTGGG GTTGAAACAG GGACAGTTGG TGCAGGTGCG GCTGACACTC
GCAGCATTCC ATGAACGACG CTTTGTGTCG GTGCGTGATG CGCTTCCCGC AGGATTTGCG
CTGGTCGAGA CCGACGCCGG TTCGATTTTC CAGATTGACG CTTTTGACGA CCGGATTGAA
ATTGCCGCTG AAACATTGCC GCCTGGCATT CACCAATACA CCTATCTGGC GCGTGCGTCT
GTTGCGGGAG CATACGCTGC ACCGCCACCC AAATTGATCC TGCCTGGAGG TCGCGCATTC
ACCGGCGTGG CGACGGCAAA CATGGTGCGG ATCGATGCGG CCGCACGAGC ATGA
 
Protein sequence
MRVLRQIGVI WTGALLVIVV ASLALPGARF LILPALSAVP AVVSVSPPDG ARSVSPRAVL 
TIQFSASMNP PSVEHALRID PATDVVYGWD LDRTTLTITP TTALQAGVRY HITVAETALS
RYFRPLAQPF TFAFETAPPA AITALLPRDG SVDVALDTLV GVRFSRPIVS LDALARSATL
PALRSDPPLA GSVTWLDPAT LLFRPSEPLR PGVRYTFSLD PDLTDQSGVP LGRAYTWSFT
TLAPMVREVS PPPNARLVGP YEPLRMVFSQ PVDLQALEAA LSITPAAPGT LEEALLPDGT
QIVTYTPTIA WQAGTAYTVA LPAALADGTA LLAQPYRWSF VTAPKPALIG RFPGEGQLLP
PGSNVRLIFS TPVDAEALRA QLRIDPPVDA LRVTTNDGEA RIDASLQAAT LYTITIPASL
TDRAGVALER DYQVRFFTAP AVPSLTLPEA TGRVIQALPD QTIGVLTRRT NLSELRLALY
RLDEATLLRA LSFSDSEWTA FEPSRYGLSL LRSWSEPLAD PLNTTVESNV TVTLDDGSPP
PSGIYFLRLR TPERAGTGVI LVVSRAALSL QVIGQRAIVW TTDTVSATVI PDAPLALYRQ
GSPVAVGRSD DRGVWTIDLS SMNPRDLVAI SSDLSTFAAL ETPPPTAPAP RLRIVLATDR
TVYSPGAQVS IRGFARQVDG QSFEPPTPGL PLSLEVRDPS GRTVQKRITL DGTGVFETTL
PLSDNAQPGI YRVSTPQDTD STLAFYVDES MPLRVTVARD HNNDVLVTVR TPEGLPVAGA
DVAWTIDCEP LSPPMNGEIV FGAADPPPPL SGTGVADSNG VLVIATPSLS PASFYRYRFR
ARVAEPYGPA ITIERLLETP PAPLVGLRTL SSIVRVGTPA SVEVITLIGD QPLAAQRVQI
EATLLNGEAS NDAAASPVDR PILSRVVETD SDGRAILSVP LPAPGVYRVR ASLDSVAQAT
PPTDLILRAY QPGFTDWRGG QQGTMLLTDR RQYRPGDTAL LLPLTALPEG PALLTVHSSS
GDVREELRTL RAGEPMTLTL TPDDAPGVWV TLTPALRLPV QHPLQVDLPV VAVDTSLSLT
LTTDAQTYTP DTNATLTLTV TNANDTPTPV DTLVRIVTDD AEMQQTVVWR VERTNDAGVL
RINARLPRSP GIVPVDVWVA GERGIGRVST RLTVVQPVAA QIIAPPFARA GDQIDTRIRL
TTTDGITRET SVALRLPDGT SSVQIVTIPA TGVTSLPFTL RAPDAIALEV QATVTTGAAF
SETVRTTLPV RSPAATISSS GGALVTDRFE TQIARPGDGS AEWGWLDLAV APSLKGLSLE
QSRAFMALPD RHSLEDAAII LMAASLTEAR PDVQAATKHL VERQTNDGGW SWRFGGGSNP
AVTAIVLEAL ADAKAAGVPL PDGSLERATT LALRLVRDPD VPLETRFCLS SALTQLGVVE
SLLPRDWDEG ELDVHGMACH LFVLPPDQAR VSPTLPRLIS LAQRADGKAW WSAPPDSAFP
YDDVATTALA MRAIHHASPR HPLATDATRW LIARMTPTGW GDALTTARVV QALRVIMPAD
ASASVTLSLN GAPITLPDTP DATLRLVPIP IADLRPTNTL VVTSSGAPAL VAWQTTHAVS
APLSFEGVGL LREYLDPRTG APINPVGLKQ GQLVQVRLTL AAFHERRFVS VRDALPAGFA
LVETDAGSIF QIDAFDDRIE IAAETLPPGI HQYTYLARAS VAGAYAAPPP KLILPGGRAF
TGVATANMVR IDAAARA